AI Content Specialist
RLHF (Reinforcement Learning from Human Feedback): "Ranked and evaluated model-generated responses based on helpfulness, honesty, and harmlessness (HHH) criteria". Prompt Engineering: "Authored and refined complex prompts to test edge cases, safety boundaries, and reasoning capabilities of Large Language Models (LLMs)". Quality Assurance (QA): "Conducted rigorous fact-checking and rubric-based scoring to ensure model outputs aligned with strict project guidelines". Domain Expertise: "Provided high-level reasoning and expert-level solutions for [Math/Coding/Creative] tasks to improve model accuracy in specialized fields". Consistency: "Maintained a 95%+ accuracy rating across high-volume labeling tasks while strictly adhering to evolving project documentation".