For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Amie Twyford

Independent Applied AI Researcher – Multi-Agent LLM Evaluation/Guardrail Testing

USA flag
New York, Usa
$45.00/hrExpertMercorInternal Proprietary ToolingOther

Key Skills

Software

MercorMercor
Internal/Proprietary Tooling
Other
LabelboxLabelbox
Scale AIScale AI
CVATCVAT
Label StudioLabel Studio

Top Subject Matter

LLM Evaluation
AI Security
Agentic Workflow

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
DocumentDocument

Top Task Types

RLHF
Transcription
Prompt Response Writing SFT

Freelancer Overview

Independent Applied AI Researcher – Multi-Agent LLM Evaluation/Guardrail Testing. Brings 10+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Labelbox. Education includes Certificate, Stanford University (2025). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

ExpertEnglish

Labeling Experience

Labelbox

Independent Applied AI Researcher – Multi-Agent LLM Evaluation/Guardrail Testing

LabelboxText
Designed and tested multi-agent LLM evaluation frameworks for AI output reliability. Developed automated adversarial guardrail and prompt-injection stress testing procedures for agentic workflows. Created structured criteria for judging LLM responses in credibility models. • Designed evaluation metrics to assess LLM reliability for complex outputs • Automated adversarial attacks and guardrail tests for AI safety • Utilized tools like LangSmith and Labelbox for structured monitoring • Documented and analyzed results to guide AI security enhancements

Designed and tested multi-agent LLM evaluation frameworks for AI output reliability. Developed automated adversarial guardrail and prompt-injection stress testing procedures for agentic workflows. Created structured criteria for judging LLM responses in credibility models. • Designed evaluation metrics to assess LLM reliability for complex outputs • Automated adversarial attacks and guardrail tests for AI safety • Utilized tools like LangSmith and Labelbox for structured monitoring • Documented and analyzed results to guide AI security enhancements

2023 - Present

Education

M

MIT

Certificate, Emerging Technologies

Certificate
2025 - 2025
S

Stanford University

Certificate, Applied Artificial Intelligence, Data Analysis, Customer Experience

Certificate
2025

Work History

E

Elevation Marketing

Senior Manager, Partnerships

New York
2021 - Present
D

DDR Media

Director of Data Monetization

New York
2020 - 2021