PDF Table Extraction & OCR

Extract tables from PDF documents including scanned images with OCR. Handles merged cells, multi-page tables, and complex layouts. Exports to CSV, JSON, or Excel with column type detection and data cleaning.. Input: PDF file (text or scanned). Output: CSV, JSON, Excel (.xlsx). Python 3.10+ (pymupdf, pytesseract). 91% reliable. Use programmatically via API or CLI.

Install via Dejavu

1 Install the Dejavu MCP package:
pip install dejavu-mcp
2 Subscribe at keepingtrack.biz/skills-landing ($6.67/month)
3 Install this skill:
dejavu skill install pdf-table-extraction-ocr
4 Your AI agent can now use it:
dejavu skill execute pdf-table-extraction-ocr --input '...'

Pricing

$6.67/month subscription includes 1,000 free credits. This skill uses credits per execution. Subscribe now →

Get Dejavu — $6.67/month