Extract tables from PDF documents including scanned images with OCR. Handles merged cells, multi-page tables, and complex layouts. Exports to CSV, JSON, or Excel with column type detection and data cleaning.. Input: PDF file (text or scanned). Output: CSV, JSON, Excel (.xlsx). Python 3.10+ (pymupdf, pytesseract). 91% reliable. Use programmatically via API or CLI.
pip install dejavu-mcp
dejavu skill install pdf-table-extraction-ocr
dejavu skill execute pdf-table-extraction-ocr --input '...'
$6.67/month subscription includes 1,000 free credits. This skill uses credits per execution. Subscribe now →