Multilingual Speech-to-Text Audio Annotation for Voice AI Training
As part of an AI voice model training initiative with Scale AI, I worked on a large-scale audio annotation project involving diverse English and Spanish datasets. My role included segmenting long-form speech, labeling speaker changes, correcting transcription errors, tagging environmental sounds, and aligning timestamps with high precision. I adhered to strict quality benchmarks and met aggressive delivery schedules to ensure the data met training-ready standards.