Generalist AI Expert — Model Response Evaluation (MRN)
As a Generalist AI Expert at Mercor, I evaluated language model responses using structured rubrics and provided detailed preference annotations. I fact-checked model outputs across diverse topics and ensured logical reasoning in AI-generated text. I documented selection rationale and maintained rigorous quality standards throughout the annotation process. • Assessed LLM responses using a 6-dimension evaluation rubric. • Produced preference annotations for reward model training. • Performed multi-step fact-checking and logical verification. • Provided thorough written justifications for annotation decisions.