Generalist
• Performed daily preference ranking of hundreds of multimodal AI outputs (images, audio, and video) against prompts to evaluate model performance and alignment. • Applied systematic comparative judgment techniques to assess relevance, quality, and coherence of generative model outputs. • Utilized labeling platforms and guidelines to ensure precise annotation.