AI Training & Multi-Modal Data Labeling Project (Images, Videos & Audio)
Worked on audio data labeling projects supporting the training of speech recognition, natural language processing (NLP), and AI models. Used industry-standard annotation tools such as CVAT and Labelbox for managing and reviewing audio datasets. Performed audio transcription, classification, and segmentation tasks, including labeling speech, non-speech events, and speaker-related attributes. Annotated datasets for emotion recognition, speech-to-text, and audio classification models, ensuring accurate and consistent labeling. Followed strict project guidelines and quality standards, conducting thorough quality assurance checks to maintain high transcription accuracy and annotation consistency. Efficiently processed large volumes of audio data while meeting productivity targets and deadlines. Collaborated remotely with project teams to address feedback, refine annotation outputs, and continuously improve dataset quality for AI training purposes.