Expert STEM Data Annotation
I serve as a specialized consultant for high-priority AI training projects, focusing on the curation and validation of complex STEM datasets. My work involves authoring high-quality prompt-response pairs for instruction-tuning and performing rigorous RLHF (Reinforcement Learning from Human Feedback) to improve model accuracy and helpfulness. I have curated over 10,000 high-quality data entries, specifically targeting advanced topics in software development, physics, and mathematics. Additionally, I execute 'red teaming' scenarios to identify and mitigate model hallucinations, logical inconsistencies, and safety risks.