What do you mean when you say "vector geometry"? Are you using the geometry extr...

aakashprasad91 · 2025-12-11T16:11:03 1765469463

Great question. By “vector geometry” we mean we’re using the underlying CAD-style vector data embedded in many PDFs (lines, arcs, polylines, hatches, etc.), not just raster images. We reconstruct objects and regions from that geometry, then fuse it with OCR (for annotations, tags, labels) and a detection model that operates on rendered tiles. The detector + OCR tells us what something is; the vector layer tells us exactly where and how it’s shaped so we can run dimension/clearance and cross-sheet checks reliably.

djprice1 · 2025-12-11T16:36:14 1765470974

Woah! What determines if something is an object at that vector level? I've done some light PDF investigations before and the whole PDF spec is super intimidating. Seems insane that you can understand which things are objects in the actual drawing at the PDF vector level

knollimar · 2025-12-13T01:54:31 1765590871

Mamy of the drawings in pdf space have some layer data from CAD/revit attached to them that might make it easier to cluster objects

aakashprasad91 · 2025-12-15T22:42:38 1765838558

Yep, exactly, when layer data survives the PDF export, it’s a huge help. We use it as a weak signal for clustering and object grouping, but never rely on it fully since it’s often inconsistent or stripped. When it’s there, accuracy and speed both improve noticeably.