For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Mandy Hathaway

Mandy Hathaway

AI ethics Specialist and AI Trainer

USA flagRemote, Usa
$50.00/hrIntermediateAws SagemakerAppenLabelbox

Key Skills

Software

AWS SageMakerAWS SageMaker
AppenAppen
LabelboxLabelbox
MercorMercor
Micro1
MindriftMindrift
CrowdSourceCrowdSource

Top Subject Matter

AI Ethics
Responsible AI
Conversational Design

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

RLHF
Evaluation Rating
Prompt Response Writing SFT
Fine Tuning
Classification
Bounding Box
Object Detection
Text Summarization

Freelancer Overview

RLHF Preference Annotation Dataset Creator. Brings 16+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Hugging Face. Education includes a Master of Arts, Metropolitan State University (2021), and a Bachelor of Arts, Metropolitan State University (2017). AI-training focus includes data types such as Text and labeling workflows, including RLHF.

IntermediateEnglish

Labeling Experience

RLHF Preference Annotation Dataset Creator

TextRLHF
Led the creation and publication of an RLHF preference annotation dataset focused on AI ethics. Developed 95 prompts with 190 response pairs, detailed scoring criteria, and failure modes for each annotation. Contributed written justifications for dispreferred responses to improve AI model alignment. • Published dataset on Hugging Face. • Annotated across six ethics categories: refusal edge cases, sycophancy, parasocial attachment, anthropomorphism, bias, and dual-use harm. • Used five scoring dimensions and explicit failure mode labeling. • Built a DPO fine-tuning notebook using Colab and microsoft/phi-2 with LoRA adapters.

Led the creation and publication of an RLHF preference annotation dataset focused on AI ethics. Developed 95 prompts with 190 response pairs, detailed scoring criteria, and failure modes for each annotation. Contributed written justifications for dispreferred responses to improve AI model alignment. • Published dataset on Hugging Face. • Annotated across six ethics categories: refusal edge cases, sycophancy, parasocial attachment, anthropomorphism, bias, and dual-use harm. • Used five scoring dimensions and explicit failure mode labeling. • Built a DPO fine-tuning notebook using Colab and microsoft/phi-2 with LoRA adapters.

2024 - Present

Education

M

Metropolitan State University

Master of Arts, Ethical Technology and Artificial Intelligence

Master of Arts
2018 - 2021
M

Metropolitan State University

Bachelor of Arts, Philosophy (Ethics and Language)

Bachelor of Arts
2014 - 2017

Work History

M

Mandy Hathaway Consulting

AI Ethics Consultant and Technical Writer

college station
2026 - Present
C

Coding With Kids

Remote Coding Instructor and Team Lead

Remote
2021 - Present