AI Trainer & Data Annotator
Worked as an AI Trainer & Data Annotator evaluating large language model outputs for quality and safety. Performed annotation and preference ranking on tasks such as question answering, summarisation, and instruction following. Contributed to reinforcement learning pipelines by labelling AI-generated responses and flagging policy-violating content. • Ensured consistently high annotation quality by adhering to detailed labelling guidelines. • Ranked and classified responses based on accuracy, helpfulness, safety, and tone. • Flagged harmful or biased outputs to support safety reviews. • Provided human feedback for reinforcement learning (RLHF) systems.