Remote AI Content Evaluator & Trainer
As an AI Content Evaluator & Trainer at Outlier AI, I evaluated and ranked AI-generated responses with a focus on accuracy, safety, and reasoning. I applied RLHF principles to improve Large Language Models through targeted fine-tuning. Deep research and scientific verification ensured the creation of high-quality training data sets. • Conducted evaluation and ranking of language model outputs • Applied reinforcement learning from human feedback (RLHF) techniques • Verified complex medical and scientific information • Used proprietary tools for annotation and workflow management.