AI Evaluator and Generative AI Specialist
As an AI Evaluator and Generative AI Specialist, I evaluated LLM model outputs for accuracy, safety, clarity, and hallucination risk. I performed structured annotation, data quality assessment, guideline-based evaluation, and prompt testing for generative AI models. Work consisted of detailed model evaluation, prompt engineering, red-teaming, and scenario-based analysis. • Evaluated model outputs for safety, accuracy, and clarity • Performed hallucination detection and prompt optimization • Conducted red-teaming, bias detection, and edge-case analysis • Used structured annotation under strict guidelines