AI Training Specialist (Contract) | AI Training Platform – Remote
As an AI Training Specialist, I trained and fine-tuned large language models via RLHF evaluations for accuracy and safety. I assessed model outputs, identified hallucinations and policy violations, and provided structured feedback. My work focused on optimizing model performance, prompt engineering, and maintaining quality metrics. • Led RLHF evaluations on text-based LLM outputs for alignment and helpfulness. • Delivered structured, rubric-based feedback on logical consistency and safety. • Implemented prompt rewriting and response refinement for optimal outcomes. • Maintained high task throughput while ensuring top data quality.