AI Training Specialist (Freelance) | Outlier & Shaip
As an AI Training Specialist at Outlier & Shaip, I focused on evaluating and optimizing outputs generated by AI language models using reinforcement learning from human feedback. I performed technical data labeling and annotation, specifically correcting and validating programming code. Critical analysis of complex prompts was conducted to enhance the accuracy and safety of language models. • Assessed and improved AI model responses using RLHF methodologies. • Labeled and annotated datasets by reviewing and correcting technical programming code. • Enhanced prompt design and evaluation for complex AI use cases. • Contributed to the validation and safety processes of large language models.