AI Training Specialist | LLM Evaluator | AI Safety & Alignment Analyst
As an AI training and evaluation specialist, I conducted large language model (LLM) output assessments with a focus on factual accuracy and structured justifications. My work centered on evaluating LLM completions for reasoning quality, policy compliance, hallucination detection, and instruction adherence. I utilized annotation platforms and AI tools to support reinforcement learning from human feedback (RLHF) and multilingual evaluation tasks. • Assessed LLM outputs for factual correctness and edge cases. • Enforced guideline adherence and policy compliance in responses. • Generated comparative A/B rankings and detailed feedback. • Applied structured reasoning in English and Swahili evaluations.