AI/LLM Evaluation Contributor | Freelance
As an AI/LLM Evaluation Contributor at Mindrift, I analyzed AI-generated responses to provide rankings and constructive feedback. My work focused on improving model reasoning, factual correctness, and alignment with user intent. I participated in projects involving code generation and instruction-following for SQL tasks. • Analyzed AI outputs in both multi-turn and web-assisted settings • Provided comparative rankings to guide model improvements • Collaborated on evaluation projects with a SQL emphasis • Delivered justifications to enhance LLM understanding and performance