AI Specialist - Data annotation and RLHF contributor (Freelance, Remote)
I performed data annotation and feedback rating on large language models (LLMs) to ensure outputs adhere to strict quality standards. My responsibilities included applying Maths, Statistics, and IT knowledge to provide helpful, accurate training signals to improve model performance using Reinforcement Learning from Human Feedback. I contributed to 38 AI projects focused on generative AI, impacting the underlying machine learning processes that drive these models. • Data annotation and evaluation of LLM outputs • Application of domain expertise in mathematical and technical content • Execution of RLHF workflows for supervised model improvement • Maintained quality benchmarks across multiple AI projects.