Data annotation
AI Data Labeling Specialist at Mindrift (Jan 2024 - Present), specializing in RHLF (Reinforcement Learning from Human Feedback) and LLM fine-tuning datasets. Created 10,000+ high-quality prompt-response pairs using RHLF methodology for LLM alignment, including preference annotations and reward modeling. Performed supervised fine-tuning (SFT) data labeling with classification, entity recognition (NER), and evaluation/rating tasks across healthcare and robotics domains. Used Mindrift platform for scalable annotation workflows, achieving 97% inter-annotator agreement on complex reasoning and safety-critical LLM training data.