Senior Data Annotator & RLHF Specialist
This role involved leading annotation projects for LLM fine-tuning and reinforcement learning from human feedback. Daily responsibilities included evaluating model-generated text responses for attributes like helpfulness, harmlessness, factuality, and instructions adherence. High-volume text data was annotated for diverse NLP tasks such as coreference resolution and toxicity detection. • Rated and ranked LLM outputs • Performed entity and relationship classification • Wrote and rewrote model prompts • Maintained a personal quality score of 98.2%