For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
R
Rodriguez Gabriel

Rodriguez Gabriel

Full stack software Engineer | AI Model Evaluator (RLHF & LLM Safety)

USA flagRemote, Usa
$20.00/hrExpertScale AILabel StudioCVAT

Key Skills

Software

Scale AIScale AI
Label StudioLabel Studio
CVATCVAT
LabelboxLabelbox
ProdigyProdigy

Top Subject Matter

Large Language Models (LLMs)
AI Safety
Prompt Engineering

Top Data Types

TextText
ImageImage

Top Task Types

RLHFRLHF
Data CollectionData Collection

Freelancer Overview

AI Model Evaluator (RLHF & LLM Safety). Brings 6+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Scale AI, Label Studio, and CVAT. Education includes Master of Science, Stanford University (2024) and Bachelor of Science, University of California, Berkeley (2022). AI-training focus includes data types such as Text and labeling workflows including RLHF and Data Collection.

ExpertEnglishSpanishSwahili

Labeling Experience

Scale AI

AI Model Evaluator (RLHF & LLM Safety)

Scale AITextRLHF
In this role, I evaluated and ranked large language model outputs for correctness, coherence, factual accuracy, and safety using RLHF frameworks. I designed prompt engineering strategies to test LLM reasoning, robustness, and instruction adherence. I also conducted adversarial testing and red teaming to identify vulnerabilities and provided structured feedback for alignment and RL pipelines. • Evaluated LLM outputs across various tasks including code and instruction following • Developed methodologies for adversarial testing and safety assessment • Leveraged prompt engineering in AI evaluation • Delivered structured evaluations to improve model alignment and safety

In this role, I evaluated and ranked large language model outputs for correctness, coherence, factual accuracy, and safety using RLHF frameworks. I designed prompt engineering strategies to test LLM reasoning, robustness, and instruction adherence. I also conducted adversarial testing and red teaming to identify vulnerabilities and provided structured feedback for alignment and RL pipelines. • Evaluated LLM outputs across various tasks including code and instruction following • Developed methodologies for adversarial testing and safety assessment • Leveraged prompt engineering in AI evaluation • Delivered structured evaluations to improve model alignment and safety

2024 - Present
Label Studio

AI Trainer & Data Annotation Specialist

Label StudioTextData Collection
As an AI Trainer and Data Annotation Specialist, I annotated and curated large-scale datasets for training LLMs across text, image, audio, and video modalities. I performed quality evaluation of AI-generated outputs for accuracy, reasoning, and instruction adherence. I also developed structured evaluation rubrics for benchmarking and supported multimodal AI pipelines. • Worked with text, image, audio, and video datasets for LLM training • Ensured dataset quality and completeness • Developed rubrics for model and dataset evaluation • Supported pipelines for NLP, computer vision, and audio processing

As an AI Trainer and Data Annotation Specialist, I annotated and curated large-scale datasets for training LLMs across text, image, audio, and video modalities. I performed quality evaluation of AI-generated outputs for accuracy, reasoning, and instruction adherence. I also developed structured evaluation rubrics for benchmarking and supported multimodal AI pipelines. • Worked with text, image, audio, and video datasets for LLM training • Ensured dataset quality and completeness • Developed rubrics for model and dataset evaluation • Supported pipelines for NLP, computer vision, and audio processing

2023 - 2024

Education

S

Stanford University

Master of Science, Artificial Intelligence

Master of Science
2022 - 2024
U

University of California, Berkeley

Bachelor of Science, Computer Science

Bachelor of Science
2018 - 2022

Work History

S

Scale AI

AI Model Evaluator

Remote
2024 - Present
H

Handshake AI

Full Stack Software Engineer

Remote
2022 - 2023