Subject Matter Expert
I worked on large-scale data annotation projects focused on natural language processing (NLP) and linguistic evaluation for both Swahili and English datasets, covering tasks such as text classification, sentiment analysis, intent recognition, named entity recognition (NER), and content moderation. I managed the full annotation lifecycle by reviewing guidelines, accurately labeling and validating data, conducting QA reviews on peer work, and flagging edge cases to improve guidelines. The projects involved thousands of data points per batch and were completed in collaboration with distributed teams. To maintain high standards, we followed strict quality control processes, including inter-annotator agreement checks, random QA audits, continuous feedback and retraining, and consistency checks against guidelines, ensuring high accuracy, cultural relevance, and linguistic correctness across all deliverables.