AI Trainer/ Data Annotator
Conducted daily preference ranking of hundreds of multimodal AI outputs (images, audio, and video) against prompts to measure model performance and alignment. • Applied systematic comparative judgment techniques to evaluate relevance, quality, and coherence, ensuring consistent assessment across diverse modalities. • Maintained high accuracy and throughput in large-scale annotation workflows, delivering reliable training data to support model fine-tuning and preference optimization. • Collaborated with cross-functional AI teams to identify quality trends and provide actionable insights for improving generative model capabilities. • Leveraged specialized labeling platforms and strict evaluation guidelines to produce precise annotations, supporting reinforcement learning from human feedback (RLHF) processes.