For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
H
Heather Sullivan

Heather Sullivan

AI Response Evaluator

United Kingdom flagManchester, United Kingdom
$35.00/hrExpertAppenLabelbox

Key Skills

Software

AppenAppen
LabelboxLabelbox

Top Subject Matter

AI Language Model Evaluation
General AI/ML Data Labeling

Top Data Types

TextText
ImageImage
VideoVideo

Top Task Types

ClassificationClassification
Object DetectionObject Detection
Text GenerationText Generation
Question AnsweringQuestion Answering
Fine-tuningFine-tuning
TranscriptionTranscription
Data CollectionData Collection
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)

Freelancer Overview

AI Response Evaluator. Brings 9+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Appen. Education includes Bachelor of Science, Open University (2018). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Classification.

ExpertEnglish

Labeling Experience

AI Response Evaluator

Text
In this role, I evaluated AI-generated responses in multi-turn conversations using structured rubrics. My focus was on instruction adherence, logical reasoning, factual accuracy, and coherence in the outputs. I provided detailed feedback to improve model performance and identified edge cases or failure patterns. • Applied complex, evolving guidelines with high accuracy • Assessed language model consistency and reliability • Delivered clear, comprehensive evaluations on large data batches • Supported iterative improvements for AI conversational systems

In this role, I evaluated AI-generated responses in multi-turn conversations using structured rubrics. My focus was on instruction adherence, logical reasoning, factual accuracy, and coherence in the outputs. I provided detailed feedback to improve model performance and identified edge cases or failure patterns. • Applied complex, evolving guidelines with high accuracy • Assessed language model consistency and reliability • Delivered clear, comprehensive evaluations on large data batches • Supported iterative improvements for AI conversational systems

2024 - Present
Appen

AI Training Contributor

AppenTextClassification
During my time as an AI Training Contributor at Appen, I worked on long-term projects involving annotation and categorization of various datasets. Tasks included labeling and organizing text, audio, and image data to train machine learning models. I maintained high accuracy and adaptability across multiple, frequently updated projects. • Completed large-scale repetitive dataset labeling • Followed evolving guidelines and quality requirements • Contributed to search relevance and AI model performance • Demonstrated reliability over a sustained period

During my time as an AI Training Contributor at Appen, I worked on long-term projects involving annotation and categorization of various datasets. Tasks included labeling and organizing text, audio, and image data to train machine learning models. I maintained high accuracy and adaptability across multiple, frequently updated projects. • Completed large-scale repetitive dataset labeling • Followed evolving guidelines and quality requirements • Contributed to search relevance and AI model performance • Demonstrated reliability over a sustained period

2014 - 2023

Education

O

Open University

Bachelor of Science, Earth Science

Bachelor of Science
2018 - 2018

Work History

F

Freelance

Actor (Screen & TV)

Manchester
2018 - Present