Coffee by Appen
The Coffee project was a large-scale Appen initiative focused on annotating naturally occurring human conversations to improve AI voice assistants and NLP systems. Working within the ADAP platform, I performed verbatim transcription capturing all stutters, fillers, and disfluencies, applied structured tags for non-speech sounds, conducted multi-speaker identification and turn-taking annotation, marked overlapping speech, and labeled emotional tone based on vocal delivery. The project was high-volume, involving hundreds of audio segments processed through a comprehensive multi-step annotation workflow. Quality was upheld by listening to each full segment before annotating, using playback tools to resolve unclear speech, maintaining strict consistency in tagging and transcription conventions, and following Appen's versioned internal guidelines throughout — ensuring all output met the standards required for model-ready AI training data.