Overview
Extracts any data you need from PDF documents using a visual annotation workflow. You highlight the areas of interest directly on the document, optionally place column guides, and click Convert. The widget does the rest and sends a clean, structured table to your workflow.
Best For
PDFs exported from accounting software, ERP systems, or government portalsMulti-page reports — extract any data you need from any page, whether it is a formal table or a structured layout with names, IDs, amounts, or any other fieldsAny document format — the app adapts to your data regardless of how it is laid out on the pageRecurring reports with the same layout — annotate once, reuse every time
How to Use
Click the Open File button in the toolbar to load your PDF. Use the page controls in the toolbar to move between pages.
Select the Rectangle tool in the toolbar. Click and drag over any area of the document that contains the data you are interested in — this tells the widget where to look. You can draw multiple areas and assign each a different color. Each color acts as a label you can use later to filter and work with specific sections independently in other widgets such as Filter Builder.
Select the Guide tool and click to place vertical lines where you want column boundaries to be. This is optional — if you skip it, the widget estimates column positions automatically. It does this independently for each page, so it adapts even when column widths vary.
Click the Convert button. The widget processes your highlighted areas and sends a structured table to the output. Connect it to a Data Table to review your results, or pipe it directly to the next step in your workflow.
InputPDF File
Multi-page PDFs are fully supported — navigate between pages using the toolbar controlsThe PDF must have a text layer — documents that are purely scanned images require OCR processing before useVery large documents may take a moment to load — only the current page is rendered at a time
OutputData Table
Multiple highlighted areas on the same page are merged into one output tableYour annotations and settings are saved with the workflow file — reopen and click Convert again without redrawingThe output is available immediately after Convert — connect it to any downstream widget
Toolbar Reference
Hand ToolWhen toggled — click and drag anywhere on the canvas to pan across the document.
Selection ToolClick an existing annotation to select, reposition, or resize it.
Page NavigationStep through pages. The input shows your current page out of the total — you can also type a page number directly.
Keyboard Shortcuts
H
Toggle Hand toolV
Toggle Selection toolG
Toggle Guide tool (column delimiter)Ctrl↵ Enter
ConvertCtrlO
Open PDFCtrl+
Zoom inCtrl−
Zoom outCtrl0
Fit page to windowCtrlScroll ↑
Zoom inCtrlScroll ↓
Zoom outTips & Notes
Reuse your annotations
If you receive the same report layout every month, save the workflow. Your annotations are stored — just swap the PDF file and click Convert again.
Use colors to filter sections later
Each highlighted area gets a color. In downstream widgets like Filter Builder, you can use these colors to work with specific sections of the extracted data independently.
Column alignment issues
If columns appear merged in the output, try adding column guides between them. If rows are merged, your highlighted area may be too tall — trim it to cover only the rows you need.
Image-only PDFs produce no output
If you open a scanned document and get empty results, the file has no text layer. Run it through OCR software first, then open it in this widget.