For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Rodrigo Santesteban

Rodrigo Santesteban

LLM Evaluation & Data Labeling Specialist | Multilingual (5 languages)

Spain flagPamplona, Spain
$20.00/hrIntermediateLabelboxRemotasksScale AI

Key Skills

Software

LabelboxLabelbox
RemotasksRemotasks
Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
Computer Code ProgrammingComputer Code Programming
VideoVideo

Top Task Types

Computer Programming Coding
Prompt Response Writing SFT
Text Generation

Freelancer Overview

I have practical experience in AI training data, focusing on LLM evaluation, prompt engineering, and multilingual text annotation. I’ve collaborated with U.S. teams to assess model outputs, refine prompts, and ensure high-quality, instruction-aligned responses. My 12-year background in journalism gives me strong analytical and content-review skills, while my technical experience in Node.js, Python, and TypeScript helps me understand how annotated data powers AI systems. I work fluently in five languages, allowing me to contribute effectively across diverse NLP tasks.

IntermediateFrenchGermanEnglishSpanishPortuguese

Labeling Experience

Labelbox

LLM Code Evaluation & Annotation with Labelbox

LabelboxComputer Code ProgrammingText GenerationEvaluation Rating
Worked on a coding-focused LLM project using Labelbox, evaluating model-generated code and programming tasks in Java, Go, and neural network implementations. Rated code for correctness, efficiency, and instruction compliance, created prompt–response pairs for fine-tuning, and validated function-calling outputs. Ensured high-quality annotations through strict technical guidelines and consistency checks across large-scale datasets.

Worked on a coding-focused LLM project using Labelbox, evaluating model-generated code and programming tasks in Java, Go, and neural network implementations. Rated code for correctness, efficiency, and instruction compliance, created prompt–response pairs for fine-tuning, and validated function-calling outputs. Ensured high-quality annotations through strict technical guidelines and consistency checks across large-scale datasets.

2024
Scale AI

LLM Code Evaluation & Programming Task Annotation at Scale AI

Scale AIComputer Code ProgrammingRelationshipClassification
Worked on a coding-focused LLM training project, evaluating model-generated code and programming tasks in Python, JavaScript/TypeScript, Node.js, and PHP. Rated code for correctness, efficiency, and instruction compliance, created prompt–response pairs for fine-tuning, and validated function-calling outputs, ensuring high-quality annotations through strict technical guidelines and consistency checks.

Worked on a coding-focused LLM training project, evaluating model-generated code and programming tasks in Python, JavaScript/TypeScript, Node.js, and PHP. Rated code for correctness, efficiency, and instruction compliance, created prompt–response pairs for fine-tuning, and validated function-calling outputs, ensuring high-quality annotations through strict technical guidelines and consistency checks.

2024

Education

S

Soy Henry

Full Stack Developer, Full Stack Development

Full Stack Developer
2023 - 2023
F

FreeCodeCamp

Back End Developer, Back End Development

Back End Developer
2023

Work History

S

Scale IA

AI Developer

Pamplona
2024 - Present
C

Coderhouse

Back End Course Professor

Pamplona
2023 - Present