For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
W

Wilson Wyatt

AI Mathematical Reasoning Evaluator

USA flagStanford, Usa
Entry Level

Key Skills

Software

No software listed

Top Subject Matter

Mathematics Domain Expertise
AI Reasoning
LLM Evaluation

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming

Top Task Types

No task types listed

Freelancer Overview

AI Mathematical Reasoning Evaluator. Brings 8+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, California Institute of Technology and Bachelor of Science, University of California, Los Angeles. AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

Entry Level

Labeling Experience

AI Mathematical Reasoning Evaluator

Text
Developed rigorous multi-step calculus and algebra problems to evaluate the reasoning capabilities of Large Language Models (LLMs). Applied structured scoring criteria to measure AI-generated solutions for accuracy, logical structure, and constraint alignment. Used Python libraries to validate outputs, detect errors, and document findings in detail. • Designed and evaluated mathematical test data specifically for AI assessment. • Implemented a comprehensive validation approach using NumPy and SciPy. • Defined and maintained detailed scoring rubrics for AI outputs. • Documented scientific edge cases and logical boundary conditions to enhance AI evaluation integrity.

Developed rigorous multi-step calculus and algebra problems to evaluate the reasoning capabilities of Large Language Models (LLMs). Applied structured scoring criteria to measure AI-generated solutions for accuracy, logical structure, and constraint alignment. Used Python libraries to validate outputs, detect errors, and document findings in detail. • Designed and evaluated mathematical test data specifically for AI assessment. • Implemented a comprehensive validation approach using NumPy and SciPy. • Defined and maintained detailed scoring rubrics for AI outputs. • Documented scientific edge cases and logical boundary conditions to enhance AI evaluation integrity.

2021 - Present

Education

U

University of California, Los Angeles

Bachelor of Science, Mathematics

Bachelor of Science
Not specified
C

California Institute of Technology

Master of Science, Applied Mathematics

Master of Science
Not specified

Work History

S

Stanford University

Research Mathematician

Stanford
2021 - Present
U

University of California, Berkeley

Mathematics Instructor / Graduate Researcher

Berkeley
2019 - 2021