AI Content Evaluator & RLHF Specialist
I collaborate with Outlier on high-level RLHF projects, where I evaluate and rank AI-generated responses based on strict criteria: accuracy, safety, and helpfulness. My role involves analyzing complex prompts, performing fact-checking, and providing detailed justification for the model's performance to ensure the refinement of Large Language Models (LLMs)