Delivery Manager
– Started as a trainer for Codeforces-style competitive programming tasks, evaluating contributor solutions and guiding improvements in algorithmic reasoning and code quality. – Worked on the OSWorld RLHF dataset and later managed RL data operations for a team of 20 contributors focused on reinforcement learning training data. – Led development of CUA and SFT datasets used for training large language models while ensuring annotation quality and guideline compliance. – Promoted to Delivery Manager managing 250+ contributors and 10 POD Leads while coordinating dataset production for enterprise clients including Alibaba. – Designed task distribution pipelines, review workflows, and quality control systems to maintain high quality LLM training data.