AI Training Data Generalist – RLHF & Annotation Specialist
Contributed as an AI training data generalist on platforms such as Outlier and Mercor, supporting the development and optimization of large language models. Performed high-quality annotation and evaluation tasks including text classification, response ranking, and prompt-response assessment based on detailed guidelines. Played a key role in RLHF workflows by evaluating model outputs for accuracy, coherence, relevance, and safety, while providing clear justifications for decisions. Maintained consistency across diverse task types, identified edge cases, and adapted quickly to evolving project requirements, helping improve overall model performance and reliability.