Freelance Contributor (AI Training & RLHF)
As a Freelance Contributor for Data Annotation Tech, I evaluated and rated AI-generated outputs for large language models. My responsibilities included reinforcement learning from human feedback (RLHF) and technical verification. This role required applying formal logic and scientific knowledge for chatbot and LLM improvement. • Judged logic, safety, and factual accuracy in AI responses. • Audited technical, scientific, and mathematical outputs for correctness. • Enhanced model performance by identifying nuances in intent and reasoning. • Used Data Annotation Tech’s platform for managing and submitting reviews.