Rubric project
Contributors assessed prompt-response pairs for accuracy, reasoning quality, clarity, and adherence to complex guidelines. In addition to applying evaluation criteria, contributors were responsible for developing and refining detailed rubrics used to systematically assess model outputs. The project required identifying factual errors, logical gaps, and instruction-following issues, while providing comparative rankings and actionable feedback to enhance overall model performance.