AI Trainer & Data Contributor
As an AI Trainer and Data Contributor for Mercor, and Scale AI, I completed a high volume of reinforcement learning from human feedback (RLHF) and evaluation tasks for frontier language model fine-tuning. My responsibilities included RLHF preference labeling, instruction-following evaluation, code correctness review, and creative-writing quality scoring. Additionally, I authored prompts across coding, math, and reasoning domains, performed factual verification, and conducted entity annotation using Labelbox software. • Completed over 500 AI training tasks emphasizing RLHF preference labeling and output evaluation. • Maintained a >95% quality acceptance rate and met strict deadlines for multiple concurrent annotation projects. • Utilized Labelbox, Outlier, Mercor, and Scale AI software for evaluation and annotation across diverse domains. • Specialized in preference labeling, factual verification, and creative writing quality scoring.