Speech Data Annotation & Whisper Fine-Tuning (Nigerian Accent Adaptation)
This project focused on improving automatic speech recognition (ASR) performance for Nigerian-accented English by preparing and validating high-quality speech datasets for fine-tuning a Whisper-based transcription model. Responsibilities included curating and cleaning audio samples containing diverse Nigerian accents, performing accurate manual transcription, and aligning transcripts with corresponding audio segments. I normalized text outputs to maintain consistent spelling standards while preserving accent-specific linguistic characteristics. Special attention was given to code-switching patterns (English mixed with local expressions), pronunciation variations, and phonetic inconsistencies common in regional speech.