AI Training & Data Specialist
In my capacity as an AI Training & Data Specialist, I worked extensively on reinforcement learning from human feedback (RLHF) and large language model ranking. I focused on designing, evaluating, and validating prompt outputs to enhance AI dialogue quality and safety. My role required thorough review and fact-checking for reliable training data. • Created and rated prompts and responses for LLM evaluation. • Applied advanced fact-checking and validation procedures. • Used platforms including Outlier AI, Remotasks, and Fiver for distributed tasks. • Provided feedback on model outputs for continuous improvement.