AI-Powered Dataset Preparation Tool Developer
Built an open-source interactive dataset preparation tool that combines rule-based analysis with an AI agent to detect and fix common dataset issues. The tool enables users to preview fixes and build reusable exportable pipelines for automating data cleaning tasks. Incorporated both traditional static validation and AI-driven recommendations to ensure high-quality datasets for machine learning models. • Automated the detection and correction of missing values, outliers, and formatting errors. • Enhanced reliability by validating AI agent suggestions against rule-based checks. • Supported user-driven review of all labeling and cleaning steps before export. • Streamlined the process of preparing datasets for downstream AI training workflows.