Math Reasoning Evaluator
The project involves designing original, challenging problems in university-level mathematics, including calculus, algebra, and statistics, as well as competitive mathematics, aimed at testing and extending the AI model's problem-solving capabilities. As a data labeler, my responsibilities included 1) evaluating the accuracy and validity of the model’s solutions by thoroughly analyzing its reasoning and calculations and 2) revising and improving the model’s solutions to ensure correctness and clarity. This process not only identified gaps in the model's understanding but also contributed to refining its ability to tackle diverse mathematical problems effectively.