AI Trainer & Data Annotator – Outlier AI (Remote)
As an AI Trainer & Data Annotator at Outlier AI, I labeled and evaluated AI-generated text responses for accuracy, reasoning, and safety improvements. I developed and refined prompts, created ideal responses for fine-tuning datasets, and ranked model outputs according to established evaluation metrics. My daily work emphasized rubric-based annotation, bias detection, and comprehensive quality assurance within a fast-paced remote environment. • Labeled AI-generated text responses for SFT and RLHF purposes. • Ranked responses based on coherence, correctness, and policy compliance. • Identified hallucinations, bias, and logical inconsistencies in model outputs. • Provided structured annotation feedback to align with detailed rubrics.