Speech-to-Text (ASR) Transcription & Audio Annotation for AI Model Training
Contributed to large-scale Automatic Speech Recognition (ASR) and NLP training projects involving high-accuracy transcription, speaker labeling, and audio segmentation. Key Responsibilities: Transcribed and timestamped 2,000+ hours of audio data across diverse accents and acoustic environments. Performed speaker diarization and labeled multi-speaker conversations with high precision. Cleaned and normalized transcripts to meet strict formatting and linguistic standards. Flagged unclear audio segments and applied structured noise-labeling protocols. Evaluated AI-generated transcripts and rated accuracy for model refinement. Assisted in emotion tagging and sentiment labeling for conversational AI datasets. Maintained 98%+ transcription accuracy based on QA audits. Quality Measures Followed: Adherence to verbatim and clean-verbatim transcription standards Timestamp consistency and alignment validation Multi-stage review and correction workflow Consistent terminology normalizat