AI Trainer / RLHF Specialist / AI Data Annotator
As an AI Trainer and RLHF Specialist, I performed evaluation and feedback tasks to improve large language model outputs. My responsibilities included reviewing model answers for accuracy, reasoning quality, instruction-following, and contextual correctness. I applied critical analysis to differentiate between contextually correct and superficially correct responses in text-based AI outputs. • Evaluated model outputs in natural language processing, code generation, and reasoning tasks • Used advanced judgment to ensure alignment and factual quality • Provided reinforcement learning signals through precise human feedback • Actively participated in prompt engineering and instruction-following assessments.