French Audio Transcription & Speech Data Annotation for AI Training
Worked on AI training projects focused on French speech and audio data, producing clean, standardized transcripts that accurately reflect spoken content (not translation or paraphrasing). Tasks included listening to short audio clips, applying strict transcription rules, and ensuring correct use of French spelling, accents, punctuation, and formatting. Performed listening-based verification and labeling, including flagging unclear, noisy, or unusable audio segments using predefined platform options. Maintained high consistency across annotations, prioritized accuracy over speed, and followed detailed guidelines to ensure reliable training data for speech recognition and language models. Quality measures included self-review, adherence to annotation rubrics, and consistent judgment calls when audio was imperfect or ambiguous.