LLM Evaluator & Search Rater
I evaluated AI outputs for search relevance and quality across UHRS, OneForma, and Appen platforms. Tasks included detailed rating of search engine responses using standardized rubrics. This work provided critical feedback for improving AI-powered search results. • Utilized platform-specific rating tools and interfaces • Applied consistent QA methods across hundreds of queries • Delivered high-quality relevance and intent assessments • Supported ongoing AI model enhancement and validation