Senior Reviewer
Served as a Senior Reviewer on a multimodal RLHF project through Outlier, evaluating AI-generated outputs across image and text inputs for accuracy, reasoning quality, and adherence to task guidelines. Responsibilities included identifying model failure modes such as hallucination and misalignment, rating and ranking responses to inform model improvement, and maintaining consistent quality standards across a generalist range of subject matter.