Independent Consultant for LLM evaluation (Uber, Midlocalize.com)
I conducted evaluation and review of responses from language models as part of consultancy work for AI projects. This included carefully analyzing AI-generated outputs and providing structured, actionable feedback to enhance the quality, accuracy, and cultural relevancy of model responses. I worked closely as an Independent Consultant on large language model evaluation initiatives. • Analyzed model outputs for linguistic and contextual appropriateness • Provided structured feedback to improve AI responses • Worked as an LLM evaluator and cultural studies consultant • Ensured models met user satisfaction and cultural relevance requirements