Speech Recognition Data Annotation for Multilingual ASR Models
Contributed to the development of IBM Watson’s Speech-to-Text API by annotating and evaluating multilingual speech datasets. Tasks included phonetic segmentation, utterance classification, and error evaluation to enhance automatic speech recognition (ASR) performance. Collaborated with NLP engineers to refine pronunciation modeling, prosody analysis, and data normalization. The project involved labeling over 100,000 audio segments in English and related dialects. Maintained rigorous QA standards and consistency checks to ensure 98%+ annotation accuracy.