AI Evaluator
As an AI Evaluator, I assessed AI-generated responses primarily in business, economics, and behavioral finance contexts using detailed rubrics. I designed multimodal prompts, evaluated outputs for instruction-following and reasoning quality, and ensured high-quality ratings through evidence-based justifications. Stringent adherence to guidelines, identifying factual errors, and rating outputs were core aspects of this role. • Evaluated model outputs for accuracy, truthfulness, clarity, and logical consistency • Designed complex prompts requiring data interpretation and economic reasoning • Compared multiple AI-generated responses and rated their quality with justifications • Identified inaccuracies and logical inconsistencies, ensuring adherence to guidelines