English & Swahili Audio Transcription and Annotation – Appen & TranscribeMe
Worked on multiple projects transcribing and annotating English and Swahili speech datasets for training AI speech recognition and NLP systems. Tasks involved listening to short voice recordings, accurately transcribing content, removing disfluencies, correcting grammar, and classifying speaker accents and background noise levels. I followed detailed transcription and annotation guidelines to ensure consistency across large datasets. Quality control involved peer review and automated QA systems, with a maintained accuracy rate of over 98%. Some tasks included rating ASR model outputs for fluency and comprehension. Project size included processing over 1,000 audio clips weekly, totaling over 30,000 clips throughout the engagement.