Independent AI Evaluator
As an Independent AI Evaluator, I specialized in assessing AI model responses focusing on clarity, safety, and instruction adherence. I performed pairwise ranking, conducted red teaming exercises, and audited chain-of-thought reasoning for logical consistency. My responsibilities included verifying AI-generated summaries for factual accuracy and providing culturally nuanced feedback for large language models. • Executed pairwise ranking and judgment of AI outputs. • Conducted comprehensive red teaming and bias detection exercises. • Audited reasoning chains to promote logical and factual consistency. • Verified the grounding of summaries against given source text.