Prepare, clean, and augment datasets for ML training. Deduplication, quality filtering, format conversion, train/test split, and data augmentation with configurable strategies and bias detection.. Input: Raw dataset + requirements. Output: Clean dataset + split. Python 3.11+. 95% reliable. Use programmatically via API or CLI.
pip install dejavu-mcp
dejavu skill install dataset-preparation
dejavu skill execute dataset-preparation --input '...'
$6.67/month subscription includes 1,000 free credits. This skill uses credits per execution. Subscribe now →