AI Evaluation Specialist / LLM Analyst
As an AI Evaluation Specialist and LLM Analyst, I assessed large volumes of language model-generated responses for alignment with user intent, factual accuracy, and reasoning. I provided structured, comprehensive annotations to highlight both strengths and failures in the model's outputs. Systematic fact-checking and adherence to defined conversational standards were always ensured. • Conducted detailed QA of LLM responses using public authoritative sources • Generated high-signal feedback for continual model improvement • Focused on reasoning depth, tone, logical flow, and completeness • Enabled actionable improvements by annotating response quality and issues