For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
K

Kaleb Jefferson

AI Specialist - Remote (Data Labeling, RLHF, Red Teaming, Evaluation)

USA flagCambridge, Usa
ExpertOther

Key Skills

Software

Other

Top Subject Matter

AI Alignment
Applied Ethics
Model Safety

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

RLHFRLHF

Freelancer Overview

AI Specialist - Remote (Data Labeling, RLHF, Red Teaming, Evaluation). Brings 5+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Arts, Waynesburg University (2016) and High School Diploma, Colonel Richardson High School (2016). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Expert

Labeling Experience

AI Specialist - Remote (Data Labeling, RLHF, Red Teaming, Evaluation)

OtherTextRLHF
I developed and deployed over 1,000 complex prompts across various professional domains to assess model alignment using RLHF and A/B testing processes. My work involved structured evaluation of model outputs, identifying failures according to Helpful, Harmless, Honesty standards, and composing detailed performance reports. I conducted over 800 peer reviews to uphold model fidelity while creating prompt injection attack vectors to reveal vulnerability in safety guardrails. • Designed, reviewed, and rated a high volume of prompts and model responses using RLHF pipelines. • Performed peer review of AI model outputs and flagged inconsistency or low-quality samples in annotated datasets. • Engineered adversarial prompts to test and expose safety weaknesses and policy adherence in LLMs. • Provided structured, asynchronous feedback to project leads for rubric clarity, annotation reporting, and category alignment.

I developed and deployed over 1,000 complex prompts across various professional domains to assess model alignment using RLHF and A/B testing processes. My work involved structured evaluation of model outputs, identifying failures according to Helpful, Harmless, Honesty standards, and composing detailed performance reports. I conducted over 800 peer reviews to uphold model fidelity while creating prompt injection attack vectors to reveal vulnerability in safety guardrails. • Designed, reviewed, and rated a high volume of prompts and model responses using RLHF pipelines. • Performed peer review of AI model outputs and flagged inconsistency or low-quality samples in annotated datasets. • Engineered adversarial prompts to test and expose safety weaknesses and policy adherence in LLMs. • Provided structured, asynchronous feedback to project leads for rubric clarity, annotation reporting, and category alignment.

2023 - Present

Education

C

Colonel Richardson High School

High School Diploma, General Education

High School Diploma
2012 - 2016
W

Waynesburg University

Bachelor of Arts, English Literature, Philosophy

Bachelor of Arts
2016

Work History

W

Walmart

Freezer Associate

Cambridge
2021 - 2023
S

Starbucks

Barista Trainer

Cambridge
2019 - 2021