PDF Table Extraction & OCR

Extract tables from PDF documents including scanned images with OCR. Handles merged cells, multi-page tables, and complex layouts. Exports to CSV, JSON, or Excel with column type detection and data cleaning.. Input: PDF file (text or scanned). Output: CSV, JSON, Excel (.xlsx). Python 3.10+ (pymupdf, pytesseract). 91% reliable. Use programmatically via API or CLI.

Install via Dejavu

1 Install the Dejavu MCP package:

pip install dejavu-mcp

2 Subscribe at keepingtrack.biz/skills-landing ($6.67/month)

3 Install this skill:

dejavu skill install pdf-table-extraction-ocr

4 Your AI agent can now use it:

dejavu skill execute pdf-table-extraction-ocr --input '...'

Pricing

$6.67/month subscription includes 1,000 free credits. This skill uses credits per execution. Subscribe now →

Get Dejavu — $6.67/month