LLM Preference Rater
Conducted pairwise ranking of LLM responses to STEM-related prompts, choosing the stronger response and explaining the preference decision in writing. Applied rubric-based standards for factual correctness, instruction adherence, clarity, and tone/style. Produced consistent evaluation judgments in a distributed, quality-sensitive annotation environment.