AI Data Annotation & Response Evaluation (RLHF)
I performed response evaluation and ranking tasks for AI model training, focusing on reinforcement learning from human feedback. My work required consistently high precision, subjective content judgment, and structured decision-making. I maintained content quality by enforcing guidelines and escalating edge cases during quality control reviews. • Performed RLHF-based text evaluation and ranking of model responses • Conducted quality control checks to ensure annotation consistency • Utilized structured workflows and feedback loops for ongoing improvements • Enhanced platform communication quality through ongoing moderation