Software Engineer | ML/DL Engineer (RLHF Expert)
As an RLHF expert, I contributed directly to improving the quality and reliability of production-scale LLM systems. My work involved analyzing model outputs, identifying edge cases, and applying reinforcement learning from human feedback to enhance model performance. I maintained high-quality evaluation standards across multiple advanced LLM features. • Led evaluation of LLM model outputs, focusing on accuracy and reliability • Enhanced robustness of LLM features such as Advanced Data Analysis and Browser • Mentored and onboarded a team to uphold consistent evaluation standards • Collaborated to refine and align model behavior for real-world use cases