Software Validation - AI Training Contractor
As an AI Training Contractor with DataAnnotation, I applied Reinforcement Learning from Human Feedback (RLHF) techniques to improve machine learning models. My work involved evaluating and prompting AI across multiple modalities such as text, image, audio, and computer code, ensuring adherence to safety and quality guidelines. I assessed outputs, engineered prompts, and conducted quality assurance for other analysts' work. • Utilized RLHF for text, image, audio, and code data. • Ensured legal and ethical compliance for AI outputs. • Performed prompt engineering and output assessment. • Conducted QA for peer submissions.