AI Data Trainer
For the Swahili Sentiment Analysis Dataset project, I led the curation and detailed annotation of a large-scale Swahili text corpus. I developed specialized annotation guidelines to accurately capture sentiment, linguistic nuance, and cultural context. The resulting dataset was instrumental in improving the sentiment classification accuracy for Swahili-language machine learning models. • Led annotation efforts focused on emotion and sentiment recognition within Swahili text data. • Authored comprehensive guidelines to standardize team annotation practices. • Significantly enhanced AI model understanding of Swahili-language user feedback. • Used Labelbox and Prodigy for all data annotation and curation activities.