Swahili Sentiment Analysis Dataset (Final Year Project)
Constructed a sentiment analysis dataset with over 12,000 Swahili tweets for use in AI training. Designed sentiment labels (positive, negative, neutral) according to a fixed schema and led a team of annotators in executing the task. Ensured high quality through multi-stage annotation checks and documented procedures.• Established clear annotation guidelines with edge-case resolution• Oversaw annotation consensus processes among team members• Ensured dataset package included codebook, guidelines, and quality metrics• Applied Python tools for cleaning and data exportation