For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
J
Jason Bieber

Jason Bieber

AI Training Data Evaluator

Canada flagCalgary, Canada
$15.00/hrIntermediateData Annotation Tech

Key Skills

Software

Data Annotation TechData Annotation Tech

Top Subject Matter

Information Technology / Technical Support
Generative AI & Conversational Systems
Gaming & Consumer Technology

Top Data Types

ImageImage
DocumentDocument
TextText

Top Task Types

ClassificationClassification
Text SummarizationText Summarization
RLHFRLHF
Evaluation/RatingEvaluation/Rating
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Object DetectionObject Detection
Fine-tuningFine-tuning

Freelancer Overview

Experienced AI training and data annotation contributor with hands-on work evaluating and improving large language model outputs across a wide range of tasks, including instruction-following, conversational quality, reasoning accuracy, summarization, content safety, and creative generation. Skilled in comparing model responses against detailed rubrics, identifying factual inconsistencies, rewriting low-quality outputs, and producing high-quality “gold standard” responses designed to improve model performance and reliability. Comfortable working independently in remote, fast-paced environments requiring strong attention to detail, consistency, and analytical thinking. What sets me apart is the ability to balance technical precision with genuine human perspective. I don’t just evaluate whether an AI response is “correct”. I look at whether it actually feels helpful, natural, trustworthy, and aligned with real human expectations. My background in both information technology and customer-facing environments has given me strong communication skills, patience, adaptability, and the ability to interpret nuance beyond rigid scoring guidelines. I take pride in producing thoughtful, consistent work that improves the overall quality of AI interactions rather than simply completing tasks as quickly as possible.

IntermediateEnglish

Labeling Experience

Multimodal AI Evaluation

ImageObject Detection
Worked on human preference ranking workflows for multimodal generative AI systems, evaluating image and video outputs across dimensions such as prompt alignment, aesthetic quality, consistency, artifact detection, and content policy adherence. Contributed detailed comparative evaluations used to refine model performance and training datasets.

Worked on human preference ranking workflows for multimodal generative AI systems, evaluating image and video outputs across dimensions such as prompt alignment, aesthetic quality, consistency, artifact detection, and content policy adherence. Contributed detailed comparative evaluations used to refine model performance and training datasets.

2024 - 2024

Annotation Rubric Development

DocumentFine Tuning
Contributed to annotation rubric development projects for generative AI evaluation workflows, helping define structured scoring criteria for response quality, reasoning, instruction adherence, factual accuracy, tone, and safety. Worked with complex guidelines to ensure evaluations remained consistent, objective, and aligned across diverse prompt categories and conversational scenarios.

Contributed to annotation rubric development projects for generative AI evaluation workflows, helping define structured scoring criteria for response quality, reasoning, instruction adherence, factual accuracy, tone, and safety. Worked with complex guidelines to ensure evaluations remained consistent, objective, and aligned across diverse prompt categories and conversational scenarios.

2024 - 2024

Golden Response Authoring

TextText Summarization
Authored high-quality “golden” reference responses used to train and evaluate large language models across conversational, technical, reasoning, and creative tasks. Responsibilities included interpreting complex instructions, crafting accurate and natural-sounding outputs, and ensuring responses met strict standards for clarity, helpfulness, tone, safety, and instruction adherence.

Authored high-quality “golden” reference responses used to train and evaluate large language models across conversational, technical, reasoning, and creative tasks. Responsibilities included interpreting complex instructions, crafting accurate and natural-sounding outputs, and ensuring responses met strict standards for clarity, helpfulness, tone, safety, and instruction adherence.

2024 - 2024

Education

S

SAIT

Diploma in Computer Engineering Technology, Computer Engineering Technology

Diploma in Computer Engineering Technology
2006 - 2008

Work History

B

Brandt Tractor

Warehouseperson

Calgary
2024 - Present
R

Rogers Communications

System Admin Analyst

Calgary
2020 - 2024