AI Data Specialist (DataAnnotation)
As an AI Data Specialist at DataAnnotation, I worked with reinforcement learning from human feedback (RLHF) to improve machine learning model performance. My responsibilities included designing and evaluating prompts, ensuring model safety, and reviewing the work of other data labelers. I helped models meet ethical and legal requirements through direct human assessments. • Performed RLHF to optimize LLM training and response accuracy. • Quality assured labeling data for prompt, feedback, and instruction-following tasks. • Enforced standards for model harmlessness, truthfulness, and safety. • Produced assessment reports and provided detailed evaluations for process improvements.