AI Trainer
As an AI Trainer at Outlier, I was responsible for training artificial intelligence models using Reinforcement Learning from Human Feedback (RLHF) techniques. My tasks included evaluating the credibility of model outputs and correcting computer code as part of the quality control process. This role demanded precise judgment and a deep understanding of AI systems. • Conducted RLHF-based training on text and code data. • Performed credibility assessments and factuality checks on model outputs. • Corrected and enhanced computer programming code as needed. • Collaborated with the AI development team to ensure quality and improvement.