AI Training & Data Labeling Specialist
As an AI Training & Data Labeling Specialist, I annotated and labeled Swahili-language audio datasets for machine learning model development. I applied complex annotation guidelines to various East African dialects, ensuring accurate phonetic alignment, speaker diarization, and context-specific labeling. My work directly contributed to culturally nuanced, production-ready datasets used for speech recognition and NLP pipelines. • Implemented structured quality rubrics to maintain high labeling accuracy across multilingual and code-switched audio content. • Identified and resolved ambiguous segments, enhancing dataset reliability for regional AI deployment. • Collaborated closely with AI/ML engineering teams to refine schemas and resolve edge cases for low-resource languages. • Ensured data security, confidentiality, and version control while handling sensitive audio assets.