For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Stephanie Spencer

Stephanie Spencer

AI Evaluation Specialist - Conversational AI

USA flag
washington, Usa
ExpertCVAT

Key Skills

Software

CVATCVAT

Top Subject Matter

No subject matter listed

Top Data Types

No data types listed

Top Label Types

No label types listed

Freelancer Overview

I am an experienced AI evaluation specialist with a strong background in software engineering and hands-on expertise in data annotation and LLM quality analysis. My work involves meticulously assessing large volumes of model-generated responses for factual accuracy, reasoning, tone, and adherence to guidelines, ensuring AI outputs are trustworthy and user-aligned. I excel at producing detailed, structured feedback and annotations that drive measurable improvements in NLP model performance. My analytical mindset, combined with daily interaction with modern language models and a solid foundation in computer science, enables me to identify subtle errors and provide actionable insights for model refinement across a wide range of domains.

Expert

Labeling Experience

AI Evaluation Specialist / LLM Analyst

TextEvaluation Rating
As an AI Evaluation Specialist and LLM Analyst, I assessed large volumes of language model-generated responses for alignment with user intent, factual accuracy, and reasoning. I provided structured, comprehensive annotations to highlight both strengths and failures in the model's outputs. Systematic fact-checking and adherence to defined conversational standards were always ensured. • Conducted detailed QA of LLM responses using public authoritative sources • Generated high-signal feedback for continual model improvement • Focused on reasoning depth, tone, logical flow, and completeness • Enabled actionable improvements by annotating response quality and issues

As an AI Evaluation Specialist and LLM Analyst, I assessed large volumes of language model-generated responses for alignment with user intent, factual accuracy, and reasoning. I provided structured, comprehensive annotations to highlight both strengths and failures in the model's outputs. Systematic fact-checking and adherence to defined conversational standards were always ensured. • Conducted detailed QA of LLM responses using public authoritative sources • Generated high-signal feedback for continual model improvement • Focused on reasoning depth, tone, logical flow, and completeness • Enabled actionable improvements by annotating response quality and issues

2023
CVAT

Data Annotator

CVATImageBounding BoxPolygon
working across such diverse annotation projects has strengthened my adaptability and sharpened my technical judgment. It has given me a comprehensive understanding of how data labeling varies across contexts—and how critical human precision is in building reliable AI systems.

working across such diverse annotation projects has strengthened my adaptability and sharpened my technical judgment. It has given me a comprehensive understanding of how data labeling varies across contexts—and how critical human precision is in building reliable AI systems.

2022 - 2025

Education

U

UC Berkeley

Bachelor of Science, Computer Science

Bachelor of Science
2014 - 2018

Work History

V

Various Projects

Software Engineer / Technical Analyst

Remote
2020 - 2023