Machine Learning Engineer – RLHF Specialist
I integrated RLHF-style structured human feedback processes into LLM pipeline development. I led evaluators while ensuring annotation standards and applied human feedback to improve AI alignment. I implemented and improved data curation workflows for structured model evaluation procedures. • Designed and executed RAG pipelines leveraging RLHF feedback for factual accuracy. • Trained annotation teams to deliver consistent evaluative input on LLM outputs. • Developed human feedback loops for hallucination and bias identification. • Supported enterprise clients with state-of-the-art AI retrieval and labeling workflows.