AI Training Specialist / Data Specialist (Contract)
I specialized in Reinforcement Learning from Human Feedback (RLHF) to assess and improve AI-generated responses for accuracy and natural conversational flow. I engineered diverse prompts to evaluate LLM performance, focusing on logic, creativity, and technical compliance. My role also included rigorous fact-checking and truthfulness verification to mitigate erroneous outputs. • Ranked and refined AI responses using RLHF methodologies • Designed and deployed complex LLM prompts for performance testing • Verified AI outputs against primary sources to reduce hallucination risks • Specialized in factual integrity and conversation evaluation