How to Extract a Table From a PDF to CSV
You have a PDF report with a multi-page table — sales data, transaction lists, scientific results. You need it in CSV for analysis. Copy-paste turns it into mush. PDFShed extracts it cleanly.
The problem
A PDF report has a 50-row sales table. You need to load it into Excel for filtering. Copy-pasting the table from the PDF gives you whitespace-separated text that Excel can't parse.
Use the tool now
Open the pdf to excel tool and follow the steps below.
Step-by-step
- 1
Open the PDF to Excel tool
Drop your PDF in. Tables are auto-detected.
- 2
Pick which tables to export
If the PDF has multiple tables, the tool lists them. Pick the one you want.
- 3
Choose CSV or XLSX output
CSV for raw analytics tools (Python, R, Tableau). XLSX for Excel/Sheets users who want formatting preserved.
- 4
Download and verify
Open in Excel — columns should be properly separated, rows aligned with the original.
- 5
Clean up if needed
Header row may merge or split awkwardly on complex tables. Manual cleanup of the first row is usually fast.
Pro tips
- •Works best on text-based PDFs. For scanned tables, run [OCR PDF](/en/tools/ocr-pdf) first.
- •Multi-page tables with continuing headers usually merge cleanly. Tables that change column count mid-document don't.
- •For very complex tables (cells with line breaks, merged cells), expect some manual cleanup. The tool gets you 90% of the way.
- •Save the extracted CSV alongside the PDF — versions of the PDF may have updated numbers.
Frequently asked questions
Will column headers be detected?
Usually yes — the first row of the detected table is exported as headers. For complex tables with multi-row headers, manual review is needed.
Can I extract from a scanned PDF?
Yes, after OCR. Run [OCR PDF](/en/tools/ocr-pdf) first to convert scanned tables to text, then PDF→Excel.
What about tables that span multiple pages?
Auto-merge handles continuous tables that just continue page-to-page. For tables that restart on each page, use [Merge PDF](/en/tools/merge-pdf) consideration when extracting.
Will currency symbols and number formatting come through?
Numeric values come through as raw numbers. Currency symbols are often dropped during conversion (they're re-applicable in Excel via cell formatting).