Multilingual Text Annotation for AI NLP models
The project involved annotating 15,000+ social media responses in English to create a high quality dataset for training NLP models in sentiment analysis, intent detection, and named entity recognition. The goal was to help AiI systems understand customer opinions, detect intents, and extract relevant entities across languages and cultural contexts. I specifically performed data labeling tasks like annotated posts for sentiment: positive, negative, neutral. Categorized text for intent: informational, complaint, praise or transactional. I also performed Named Entity Recognition to identify people, organizations, locations and products. I also handled multilingual content, ensuring annotations reflected language nuances, slang, and cultural context. Total annotated data was over 15,000 social media responses in English and Swahili for a period of 2 months. I achieved 95% Annotation accuracy through cross review and verification processes to deliver high quality dataset for AI NLP training