Large-Scale Multimodal AI Training Data Curation
Led end-to-end annotation across 1,500+ hours of video, image, audio, and text datasets for production LLMs and computer vision models at TELUS International and RWS Moravia. Specialized in complex video annotation including temporal segmentation, object tracking, action recognition, and scene classification with 98%+ accuracy. Conducted extensive LLM evaluation work—refining prompts, assessing model reasoning quality, and testing contextual accuracy across thousands of responses. Performed quality assurance and peer review identifying systematic errors that improved project accuracy by 15-20%. Worked on multilingual projects including 500+ hours of Swahili audio transcription, applying linguistic expertise to low-resource language datasets. Collaborated directly with ML engineering teams via GitHub and Slack to optimize annotation schemas, debug pipeline issues, and ensure dataset quality standards aligned with model perform