Data Engineering & AI Data Processing Volunteer
Developed and optimized data preprocessing pipelines for AI dataset creation focused on Ghana’s National Science & Maths Quiz. Processed and structured large-scale historical video datasets from YouTube for efficient extraction and annotation. Contributed to AI-ready dataset creation guided by best practices in supervised learning and data annotation. • Automated data cleaning and transformation to minimize manual effort and enhance reproducibility. • Improved dataset quality and usability, directly supporting enhanced model training. • Collaborated with remote teams using GitHub, Colab, and Slack for version control and workflow streamlining. • Ensured consistent documentation and reporting throughout data processing life cycle.