Extract Tables from PDF to CSV/Excel — Lasso, Detect, and Export

Our Table Extractor turns PDF tables into usable data entirely in your browser. Lasso any region, auto-detect tables from text layout or grid lines, and fine-tune columns with manual guides. Export clean CSV or Excel workbooks without uploading files to external servers.

Key Features

Lasso Selection

Draw one or multiple rectangles over PDF pages to target tables precisely. Shift+drag to add multiple regions and delete to remove.

Auto Table Detection

Two modes: Text Layout clusters text into rows/columns; Grid Lines finds ruled tables in scans using line density.

Manual Column Guides

Add and drag vertical split lines inside a selection for pixel-accurate columns—ideal for merged or irregular cells.

Smart Tolerances

Adjust row and column tolerance (px) to handle misaligned content, tight spacing, and multi-line cells.

CSV & Excel Export

Copy CSV to clipboard or download CSV/XLSX. Export all regions into a single Excel file—each region becomes its own sheet.

Privacy First

Runs 100% locally using PDF.js and SheetJS. No uploads, tracking, or server processing—your documents stay on your device.

How to Use the Tool

  1. Open PDF: Drag & drop your PDF or click to choose a file.
  2. Navigate & Zoom: Move between pages and set a comfortable zoom level.
  3. Select Table(s): Lasso the table region. Use Auto-detect for quick suggestions.
  4. Refine Columns: Toggle Guides to add/drag vertical splits or tune row/column tolerances.
  5. Preview & Export: Click Extract to preview, then copy/download CSV or export XLSX (single or multi-sheet).

Supported Operations

Detection Modes

Text Layout: Best for digital PDFs with embedded text—clusters lines and columns automatically.

Grid Lines: Best for scanned/ruled tables—detects dense horizontal/vertical lines to propose a table area.

Manual Column Guides

Add precise vertical dividers inside the selection. Great for merged headers, uneven spacing, or multi-table areas.

Export Options

  • CSV: Choose comma, semicolon, or tab delimiter; copy or download.
  • XLSX: Export the current table or all regions (each as a separate worksheet).

Technical Details

Built on robust web tech for reliability and speed:

  • PDF.js: High-fidelity PDF rendering and text extraction
  • Canvas Overlay: Interactive lasso + guide editing
  • SheetJS: Fast CSV/XLSX generation in-browser
  • Local Fallbacks: Optional vendor paths if a CDN is unavailable
  • Responsive UI: Works smoothly on desktops and modern tablets

Use Cases

Financial Statements

Extract balance sheets, income statements, and KPI tables into spreadsheets for analysis.

Research & Reports

Lift tables from academic papers, government datasets, and whitepapers without retyping.

Invoices & Receipts

Capture line items into CSV/XLSX for bookkeeping and reconciliation.

Compliance & Audit

Extract structured tables from disclosures and filings for review and auditing.

Privacy & Security

This tool is privacy-first by design:

  • No server uploads—processing happens in memory, in your browser
  • No data collection or tracking
  • Temporary processing with automatic cleanup
  • Optional local library fallbacks for offline-friendly reliability

Browser Compatibility

Works with modern browsers: Chrome, Edge, Firefox, and Safari. JavaScript must be enabled.