AI Content Specialist (LLM RLHF Trainer)
As an AI Content Specialist at Outlier.ai, I trained large language models using reinforcement learning from human feedback (RLHF). My responsibilities included crafting strategic prompts and evaluating model outputs to enhance model performance. I focused specifically on reducing hallucinations and improving the coherence of generated responses. • Developed and applied prompt-based evaluation criteria. • Analyzed and corrected model-generated text for factual accuracy. • Collaborated with team members to refine LLM fine-tuning processes. • Contributed to research on prompt engineering techniques.