For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
D

Dunn Judson

AI Evaluation Specialist

Kenya flagRemote, Kenya
$40.00/hrExpertData Annotation TechMercorMicro1

Key Skills

Software

Data Annotation TechData Annotation Tech
MercorMercor
Micro1
MindriftMindrift
OneFormaOneForma
RemotasksRemotasks
Scale AIScale AI

Top Subject Matter

Large Language Models
General Knowledge
Fact Verification

Top Data Types

TextText
ImageImage
VideoVideo

Top Task Types

Bounding BoxBounding Box
SegmentationSegmentation
ClassificationClassification
Point/Key PointPoint/Key Point
TranscriptionTranscription

Freelancer Overview

AI Evaluation Specialist. Brings 5+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Arts, University of California, Berkeley (2017). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

ExpertEnglish

Labeling Experience

AI Evaluation Specialist

Text
As an AI Evaluation Specialist, I systematically evaluated large language model (LLM) responses for clarity, reasoning, and factual correctness. My work involved fact-checking model-generated content, providing detailed feedback, and collaborating with AI development teams to improve model behaviors. Using established evaluation rubrics and taxonomies, I ensured rigorous and consistent assessment standards were met. • Assessed LLM output quality across a broad array of topics and questions. • Conducted detailed fact-checking using academic and reputable sources. • Delivered structured, actionable feedback to model engineers for iterative improvement. • Ensured reproducibility and accuracy of AI evaluation processes through benchmark adherence.

As an AI Evaluation Specialist, I systematically evaluated large language model (LLM) responses for clarity, reasoning, and factual correctness. My work involved fact-checking model-generated content, providing detailed feedback, and collaborating with AI development teams to improve model behaviors. Using established evaluation rubrics and taxonomies, I ensured rigorous and consistent assessment standards were met. • Assessed LLM output quality across a broad array of topics and questions. • Conducted detailed fact-checking using academic and reputable sources. • Delivered structured, actionable feedback to model engineers for iterative improvement. • Ensured reproducibility and accuracy of AI evaluation processes through benchmark adherence.

2019 - Present

Education

U

University of California, Berkeley

Bachelor of Arts, English Literature

Bachelor of Arts
2013 - 2017

Work History

T

TechNow Media

Content Strategist & Writer

Remote
2018 - 2021
D

Data Insights Group

Research Analyst

Los Angeles
2017 - 2019