Extract Tables from PDF to CSV/Excel — Lasso, Detect, and Export
Our Table Extractor turns PDF tables into usable data entirely in your browser. Lasso any region, auto-detect tables from text layout or grid lines, and fine-tune columns with manual guides. Export clean CSV or Excel workbooks without uploading files to external servers.
Key Features
Lasso Selection
Draw one or multiple rectangles over PDF pages to target tables precisely. Shift+drag to add multiple regions and delete to remove.
Auto Table Detection
Two modes: Text Layout clusters text into rows/columns; Grid Lines finds ruled tables in scans using line density.
Manual Column Guides
Add and drag vertical split lines inside a selection for pixel-accurate columns—ideal for merged or irregular cells.
Smart Tolerances
Adjust row and column tolerance (px) to handle misaligned content, tight spacing, and multi-line cells.
CSV & Excel Export
Copy CSV to clipboard or download CSV/XLSX. Export all regions into a single Excel file—each region becomes its own sheet.
Privacy First
Runs 100% locally using PDF.js and SheetJS. No uploads, tracking, or server processing—your documents stay on your device.
How to Use the Tool
- Open PDF: Drag & drop your PDF or click to choose a file.
- Navigate & Zoom: Move between pages and set a comfortable zoom level.
- Select Table(s): Lasso the table region. Use Auto-detect for quick suggestions.
- Refine Columns: Toggle Guides to add/drag vertical splits or tune row/column tolerances.
- Preview & Export: Click Extract to preview, then copy/download CSV or export XLSX (single or multi-sheet).
Supported Operations
Detection Modes
Text Layout: Best for digital PDFs with embedded text—clusters lines and columns automatically.
Grid Lines: Best for scanned/ruled tables—detects dense horizontal/vertical lines to propose a table area.
Manual Column Guides
Add precise vertical dividers inside the selection. Great for merged headers, uneven spacing, or multi-table areas.
Export Options
- CSV: Choose comma, semicolon, or tab delimiter; copy or download.
- XLSX: Export the current table or all regions (each as a separate worksheet).
Technical Details
Built on robust web tech for reliability and speed:
- PDF.js: High-fidelity PDF rendering and text extraction
- Canvas Overlay: Interactive lasso + guide editing
- SheetJS: Fast CSV/XLSX generation in-browser
- Local Fallbacks: Optional vendor paths if a CDN is unavailable
- Responsive UI: Works smoothly on desktops and modern tablets
Use Cases
Financial Statements
Extract balance sheets, income statements, and KPI tables into spreadsheets for analysis.
Research & Reports
Lift tables from academic papers, government datasets, and whitepapers without retyping.
Invoices & Receipts
Capture line items into CSV/XLSX for bookkeeping and reconciliation.
Compliance & Audit
Extract structured tables from disclosures and filings for review and auditing.
Privacy & Security
This tool is privacy-first by design:
- No server uploads—processing happens in memory, in your browser
- No data collection or tracking
- Temporary processing with automatic cleanup
- Optional local library fallbacks for offline-friendly reliability
Browser Compatibility
Works with modern browsers: Chrome, Edge, Firefox, and Safari. JavaScript must be enabled.