For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
I
IVEN CHAVEZ

IVEN CHAVEZ

LLM Evaluation & AI Safety Specialist

USA flagRemote, Usa
$20.00/hrExpertLabelboxRoboflowSnorkel AI

Key Skills

Software

LabelboxLabelbox
RoboflowRoboflow
Snorkel AISnorkel AI
Scale AIScale AI
CVATCVAT
AWS SageMakerAWS SageMaker
Other

Top Subject Matter

AI Evaluation & Safety
Biology Domain Expertise
AI Training

Top Data Types

ImageImage
AudioAudio
TextText
VideoVideo
DocumentDocument

Top Task Types

Bounding BoxBounding Box
Point/Key PointPoint/Key Point
ClassificationClassification
TrackingTracking
TranscriptionTranscription
PolygonPolygon
RLHFRLHF
Entity (NER) ClassificationEntity (NER) Classification
Text GenerationText Generation
Red TeamingRed Teaming
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Question AnsweringQuestion Answering
Data CollectionData Collection

Freelancer Overview

LLM Evaluation & AI Safety Specialist. Brings 3+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. AI-training focus includes data types such as Text and Image and labeling workflows including Evaluation, Rating, and Prompt + Response Writing (SFT).

ExpertPortugueseEnglishGreekSpanishFrench

Labeling Experience

Computer Code Annotation and Evaluation for AI Programming Model Training

Computer Code ProgrammingComputer Programming Coding
Contributed to AI training projects focused on improving code generation and programming assistance models. Responsible for annotating and reviewing programming datasets used to train machine learning systems capable of understanding and generating code. Tasks included evaluating AI-generated code, labeling programming tasks, classifying code functionality, and writing prompt–response examples for supervised fine-tuning of coding models. Reviewed code snippets written in languages such as Python, JavaScript, and SQL, ensuring accuracy, logical correctness, and adherence to coding best practices. Assisted in rating the quality of AI-generated code based on correctness, efficiency, readability, and functionality. Followed detailed project guidelines to ensure consistency across training datasets used for AI coding assistants and automated programming tools. Worked with structured coding datasets and performed quality checks to ensure high-quality annotations used in training and evaluating advanced code generation models.

Contributed to AI training projects focused on improving code generation and programming assistance models. Responsible for annotating and reviewing programming datasets used to train machine learning systems capable of understanding and generating code. Tasks included evaluating AI-generated code, labeling programming tasks, classifying code functionality, and writing prompt–response examples for supervised fine-tuning of coding models. Reviewed code snippets written in languages such as Python, JavaScript, and SQL, ensuring accuracy, logical correctness, and adherence to coding best practices. Assisted in rating the quality of AI-generated code based on correctness, efficiency, readability, and functionality. Followed detailed project guidelines to ensure consistency across training datasets used for AI coding assistants and automated programming tools. Worked with structured coding datasets and performed quality checks to ensure high-quality annotations used in training and evaluating advanced code generation models.

2020 - Present
Snorkel AI

CS/Software Engineering Problem Solving & Evaluation Specialist

Snorkel AI
Independently developed, reviewed, and adjudicated complex computer science and software engineering solutions with an emphasis on algorithms and coding challenges. Managed high-volume datasets and contributed structured feedback for AI training pipelines supporting STEM problem-solving models. Utilized Snorkel AI-style tools and multiple programming languages to author and evaluate production-ready code. • Code review and adjudication to ensure correctness, clarity, and efficiency. • Worked asynchronously up to 40 hours per week on CS/SE evaluation tasks. • Maintained proper version control with Git/GitHub and debugging using Jupyter Notebook. • Ensured alignment with engineering best practices in all labeling and evaluation tasks.

Independently developed, reviewed, and adjudicated complex computer science and software engineering solutions with an emphasis on algorithms and coding challenges. Managed high-volume datasets and contributed structured feedback for AI training pipelines supporting STEM problem-solving models. Utilized Snorkel AI-style tools and multiple programming languages to author and evaluate production-ready code. • Code review and adjudication to ensure correctness, clarity, and efficiency. • Worked asynchronously up to 40 hours per week on CS/SE evaluation tasks. • Maintained proper version control with Git/GitHub and debugging using Jupyter Notebook. • Ensured alignment with engineering best practices in all labeling and evaluation tasks.

2024 - Present
Labelbox

Advanced Video Annotation & Multi-Object Tracking for computer Vision

LabelboxVideoBounding BoxPoint Key Point
Executed large-scale video annotation projectsnputer vision and action recognition models Performed frame-by-frame labeling of dynamic scenes using bounding boxes, polygons, and keypoints while maintaining persistent object IDs for multi-object tracking tasks. Handled complex scenarios including occlusions, motion blur, dense traffic scenes, and rapid object movement Applied strict ontology rules and temporal consistency checks to ensure annotations met production-level Al training standards. Processed thousands of video frames across diverse environments, preparing datasets optimized for YOLO-based object detection and tracking pipelines. Maintained 95% annotation accuracy through structured QA workflows, guideline adherence, and systematic self-review. Collaborated with remote Al teams to resolve edge cases, improve labeling taxonomies, and deliver high quality datasets within tight deadlines.

Executed large-scale video annotation projectsnputer vision and action recognition models Performed frame-by-frame labeling of dynamic scenes using bounding boxes, polygons, and keypoints while maintaining persistent object IDs for multi-object tracking tasks. Handled complex scenarios including occlusions, motion blur, dense traffic scenes, and rapid object movement Applied strict ontology rules and temporal consistency checks to ensure annotations met production-level Al training standards. Processed thousands of video frames across diverse environments, preparing datasets optimized for YOLO-based object detection and tracking pipelines. Maintained 95% annotation accuracy through structured QA workflows, guideline adherence, and systematic self-review. Collaborated with remote Al teams to resolve edge cases, improve labeling taxonomies, and deliver high quality datasets within tight deadlines.

2023 - Present

AI Training Specialist

OtherTextRLHFPrompt Response Writing SFT
Conducted Reinforcement Learning from Human Feedback (RLHF) and AI safety assessments on AI-generated text data in English and Spanish. Performed prompt–response evaluation, NLP tagging, and dataset categorization to improve large language model safety and alignment. Leveraged annotation and evaluation platforms for structured feedback and quality assurance. • Evaluated safety, factual consistency, and bias in model outputs. • Participated in multi-language evaluation for greater model robustness. • Designed systematic approaches for prompt and response quality validation. • Used software platforms to efficiently scale annotation processes.

Conducted Reinforcement Learning from Human Feedback (RLHF) and AI safety assessments on AI-generated text data in English and Spanish. Performed prompt–response evaluation, NLP tagging, and dataset categorization to improve large language model safety and alignment. Leveraged annotation and evaluation platforms for structured feedback and quality assurance. • Evaluated safety, factual consistency, and bias in model outputs. • Participated in multi-language evaluation for greater model robustness. • Designed systematic approaches for prompt and response quality validation. • Used software platforms to efficiently scale annotation processes.

2022 - Present
CVAT

Text Annotation and Prompt–Response Evaluation for AI Model Training

CVATTextClassificationEntity Ner Classification
Contributed to AI training projects focused on improving large language models and conversational AI systems by annotating and evaluating large text datasets. Responsible for labeling text data, classifying content categories, and reviewing prompt–response pairs to improve model accuracy and response quality. Performed text classification, prompt and response writing (SFT), response evaluation, and quality rating tasks according to project guidelines. Analyzed responses generated by AI systems and evaluated them for accuracy, relevance, safety, and linguistic quality. Assisted in generating high-quality prompt–response examples used for supervised fine-tuning of language models. Followed strict annotation guidelines and contributed to building structured datasets used for training and evaluating advanced AI systems.

Contributed to AI training projects focused on improving large language models and conversational AI systems by annotating and evaluating large text datasets. Responsible for labeling text data, classifying content categories, and reviewing prompt–response pairs to improve model accuracy and response quality. Performed text classification, prompt and response writing (SFT), response evaluation, and quality rating tasks according to project guidelines. Analyzed responses generated by AI systems and evaluated them for accuracy, relevance, safety, and linguistic quality. Assisted in generating high-quality prompt–response examples used for supervised fine-tuning of language models. Followed strict annotation guidelines and contributed to building structured datasets used for training and evaluating advanced AI systems.

2022 - Present

Education

U

University of California

Bachelor of Science, Information Technology

Bachelor of Science
2018 - 2022
C

Central Valley High School

High School Diploma, General Education

High School Diploma
2014 - 2018

Work History

E

Einstein Stem Production

Computer Science Problem Design Specialist

Remote
2024 - Present
S

Scale Ai

Ai training expert

Seattle
2021 - 2024