AI Content Researcher/Generalist – RLHF (Reinforcement Learning from Human Feedback)
Contributed to AI training projects by evaluating and ranking model responses using Reinforcement Learning from Human Feedback. Ensured model accuracy and safety by identifying logical fallacies and factual inaccuracies in textual outputs. Applied prompt engineering and analytical writing skills to maximize LLM performance and relevance. • Evaluated and rated large language model responses. • Identified hallucinations and inaccuracies in output text. • Applied strict guidelines and rubrics for feedback accuracy. • Utilized technical reporting and fact-checking expertise in RLHF tasks.