AI Content Specialist & Evaluator
My work here focuses on high-level AI model training and evaluation using text-based datasets. The scope involves Reinforcement Learning from Human Feedback (RLHF), where I perform Side-by-Side (SxS) comparisons to rank model responses based on helpfulness, tone, and factual integrity. My specific tasks include identifying and correcting model hallucinations, prompt engineering, and rewriting AI outputs to ensure a natural, human-like linguistic flow. I handle a consistent volume of complex tasks weekly, adhering to strict quality measures by passing frequent 'Gold Set' hidden tests and maintaining high-precision alignment with the project's core guidelines.