AI Trainer (Freelance) | Outlier AI
As an AI Trainer for Outlier AI, I annotated more than 500 text samples weekly to build and refine large language model datasets. I authored prompts across coding and reasoning tasks and provided RLHF feedback to reduce hallucinations and biases in model responses. Additionally, I evaluated model outputs ensuring correctness and the application of strict labeling guidelines. • Conducted high-volume weekly data annotation for LLM training. • Designed and reviewed hundreds of prompts spanning technical and reasoning domains. • Provided RLHF-based feedback to fine-tune model outputs, decreasing error rates noticeably. • Maintained dataset quality and labeling integrity through guideline enforcement.