Subject Matter Expert & AI Trainer - Valkyrie
Served as a Subject Matter Expert on the Outlier Valkyrie project, specializing in Reinforcement Learning from Human Feedback (RLHF) to evaluate and improve Large Language Models (LLMs). I leveraged my domain expertise to design highly complex, real-world prompts focused on clinical dermatology, nutritional science, and advanced mathematics. My core responsibilities included 'model stumping' to test AI limits in these specialized fields, developing precise grading rubrics, and critically analyzing the accuracy and safety of AI-generated responses. Through rigorous human feedback, I directly contributed to enhancing the models' clinical reasoning, mathematical accuracy, and overall alignment.