Data Labeling Consultant
Supported multiple AI research and development projects focused on improving natural language understanding, speech recognition, and multilingual model training. Projects spanned linguistic data curation, annotation process design, and dataset quality evaluation. Performed manual and semi-automated annotation of text, speech, and phonetic data for supervised machine learning and NLP research. Tagged and categorized datasets using Label Studio, Prodigy, and proprietary annotation tools. Created and maintained metadata schemas Conducted phonetic transcription, entity labeling, sentiment tagging, and intent classification tasks. Participated in multilingual labeling involving English, Spanish, and Yoruba corpora, ensuring cross-lingual consistency. Contributed to datasets ranging from 50,000 to over 1 million labeled instances, depending on project scope. Adhered to precision, recall, and inter-annotator agreement (IAA) benchmarks, maintaining >95% labeling accuracy.