DocIntel — Document Intelligence
Route: /docintel
Purpose: Import BIM files and project documents into the Purple8 Graph automatically
9.1 Supported Formats
| Format | Content Extracted |
|---|---|
| IFC | Full geometry, elements, properties, relationships, spaces |
| Text, tables (via OCR), drawing annotations | |
| DWG/DXF | 2D geometry, layers, blocks (converted to elements) |
| DOCX | Specifications, reports — extracted as document nodes |
| XLSX | Room schedules, cost plans, door/window schedules |
| S3 | Any of the above, fetched from AWS S3 bucket |
| SharePoint | Documents from SharePoint libraries (Office 365) |
| Confluence | Wiki pages and attachments from Confluence |
9.2 The Pipeline
Upload / Connect
↓
QUEUED — job is in the queue
↓
PARSING — file read, format detected, structure extracted
↓
EXTRACTING — elements classified, properties normalised, relationships mapped
↓
EMITTING — graph nodes created, edges wired, LOD assigned
↓
DONE — available in BIM Viewport9.3 Uploading a File
- Click Browse or drag a file into the upload zone
- Select the file type if not auto-detected
- Optionally set a Project ID to scope the elements
- Click Upload
- The document appears in the pipeline list with status QUEUED
- Refresh after a few seconds — status updates automatically
9.4 Connecting Cloud Sources
For SharePoint, S3, or Confluence:
- Click the Connectors tab in the upload panel
- Select connector type
- Enter the required credentials (URL, API key, etc.)
- Browse and select the document or folder
- Click Connect — the same pipeline runs asynchronously
9.5 After Import
Once DONE:
- All elements appear in BIM Viewport under your project
- You can run Algorithms on the imported data
- The AI Chat can query the imported elements
- Compliance checks run on the imported floor plans
9.6 IFC Import Tips
- Use IFC2x3 or IFC4 format — IFC4X3 is supported but experimental
- Ensure IfcProject and IfcBuilding are defined at the top level
- IfcSpace elements map to
spacenodes; IfcWall towallnodes, etc. - Properties in property sets (Pset_*) are extracted as node properties
- Large IFC files (>50MB) may take 2-5 minutes to fully process