AI Data Labeler/Evaluator (Short-Term)
In this brief contract, I executed evaluations, error spotting, and qualitative feedback on AI outputs. The focus was on logic correctness, style adherence, and ranking response quality. I helped enhance the model's reasoning and consistency through detailed annotation and feedback practices. • Analyzed chain-of-thought outputs to pinpoint logic errors. • Scored AI-generated responses by project standards. • Collaborated to define style and response personas. • Documented recurrent hallucination patterns for reduction.