AI Data Partner
In this role, I evaluated large volumes of AI-generated outputs for correctness, coherence, and logical validity against stringent quality guidelines. I ranked multiple large language model (LLM) responses, identifying and documenting hallucinations, factual inaccuracies, and instruction violations. My structured, actionable feedback directly informed reinforcement learning workflows to enhance model performance. • Ensured high consistency and accuracy in all evaluation tasks. • Delivered feedback suitable for reinforcement learning with human feedback (RLHF) pipelines. • Adhered to remote work protocols and strict turnaround times. • Actively contributed to improving model instruction adherence and output quality.