AI Trainer / Data Annotator
As an AI Trainer and Data Annotator at Outlier (Scale AI Platform), I annotated and reviewed diverse datasets to train and fine-tune large language models. My work included providing reinforcement learning from human feedback (RLHF) and performing instruction and creative writing evaluations. I maintained a high standard of quality and accuracy, adhering strictly to platform and team protocols. • Labeled and reviewed text, code, and conversation datasets for LLM training. • Provided RLHF by ranking and rating AI model outputs for safety and helpfulness. • Conducted prompt engineering and instruction following evaluation tasks to enhance model performance. • Consistently maintained output accuracy above platform thresholds while collaborating asynchronously with global teams.