Search Quality & Instruction-Following Evaluation for LLMs
I participated in an evaluation project aimed at improving model instruction-following and search-related reasoning. Tasks involved reviewing model outputs for relevance, logical structure, and alignment with user intent. I also reviewed long-form answers and ensured compliance with content and safety standards. The project required analytical judgment, precise attention to detail, and consistent application of evaluation criteria.