Wilson Wyatt - AI Mathematical Reasoning Evaluator

Key Skills

Software

No software listed

Top Subject Matter

Mathematics Domain Expertise

AI Reasoning

LLM Evaluation

Top Data Types

Text

Computer Code Programming

Top Task Types

No task types listed

Freelancer Overview

AI Mathematical Reasoning Evaluator. Brings 8+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, California Institute of Technology and Bachelor of Science, University of California, Los Angeles. AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

Entry Level

Labeling Experience

AI Mathematical Reasoning Evaluator

Text

Developed rigorous multi-step calculus and algebra problems to evaluate the reasoning capabilities of Large Language Models (LLMs). Applied structured scoring criteria to measure AI-generated solutions for accuracy, logical structure, and constraint alignment. Used Python libraries to validate outputs, detect errors, and document findings in detail. • Designed and evaluated mathematical test data specifically for AI assessment. • Implemented a comprehensive validation approach using NumPy and SciPy. • Defined and maintained detailed scoring rubrics for AI outputs. • Documented scientific edge cases and logical boundary conditions to enhance AI evaluation integrity.

2021 - Present

Education

U

University of California, Los Angeles

Bachelor of Science, Mathematics

Bachelor of Science

Not specified

C

California Institute of Technology

Master of Science, Applied Mathematics

Master of Science

Not specified

Work History

S

Stanford University

Research Mathematician

Stanford

2021 - Present

U

University of California, Berkeley

Mathematics Instructor / Graduate Researcher

Berkeley

2019 - 2021