Business Analyst (AI Model Evaluation)
I evaluated and compared AI model outputs using structured metrics to improve model performance. My work involved identifying hallucinations, bias, and logical inconsistencies in generated text. I refined inputs and provided detailed written justifications for model ratings. • Conducted structured comparisons between AI models on diverse text prompts and scenarios • Assessed outputs for accuracy, coherence, instruction adherence, and reasoning • Optimized prompt structures and refined input data for better responses • Produced clear written justifications to support model evaluations