For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Adrián Valbuena

Adrián Valbuena

LLM Evaluator, software developer and data analyst

Spain flagMadrid, Spain
$25.00/hrIntermediateAppenOneformaRemotasks

Key Skills

Software

AppenAppen
OneFormaOneForma
RemotasksRemotasks
Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

ImageImage
TextText

Top Task Types

Computer Programming Coding
Data Collection
Evaluation Rating
Fine Tuning
RLHF

Freelancer Overview

Some of the projects I was involved in focused on inducing LLM models to produce failures, followed by evaluating and labeling their responses. I also have experience in projects related to response-safety, where I have worked assessing safety aspects of LLMs, labeling potentially harmful outputs, testing models against harmful requests, and refining outcomes to ensure safe responses.

IntermediateEnglishSpanishNorwegian

Labeling Experience

Scale AI

Evaluating responses

Scale AITextQuestion AnsweringText Summarization
In this project, I rate responses to prompts that are already written. The responses have to be evaluated on multiple dimensions and rewritten when necessary. Some of these dimensions are localization, writing quality, instruction following and truthfulness.

In this project, I rate responses to prompts that are already written. The responses have to be evaluated on multiple dimensions and rewritten when necessary. Some of these dimensions are localization, writing quality, instruction following and truthfulness.

2024
Scale AI

Safety

Scale AITextClassificationQuestion Answering
In this project I have created hundreds of prompts to induce responses in which the model failed to comply with safety standards. After this first step was achieved I had to rate and evaluate the response on multiple dimensions. Once this was done, the response was rewritten to meet all the requirements and standards. All the tasks were done in native Spanish (Spain) and rated and evaluated in English.

In this project I have created hundreds of prompts to induce responses in which the model failed to comply with safety standards. After this first step was achieved I had to rate and evaluate the response on multiple dimensions. Once this was done, the response was rewritten to meet all the requirements and standards. All the tasks were done in native Spanish (Spain) and rated and evaluated in English.

2024
Scale AI

RLHF

Scale AITextClassificationText Generation
In this project I have created hundreds of prompts to induce model errors. After this first step was achieved I had to rate and evaluate the response on multiple dimensions. Once this was done, the response was rewritten to meet all the requirements and standards. All the tasks were done in native Spanish (Spain) and rated and evaluated in English.

In this project I have created hundreds of prompts to induce model errors. After this first step was achieved I had to rate and evaluate the response on multiple dimensions. Once this was done, the response was rewritten to meet all the requirements and standards. All the tasks were done in native Spanish (Spain) and rated and evaluated in English.

2024

Education

A

Analistas Financieros Internacionales

Master's degree studies in Data Analytics and Visualization, Data Science

Master's degree studies in Data Analytics and Visualization
2022 - 2023
U

University of Alcalá

Bachelor's degree on Economics and International Business, Economics and Business Administration

Bachelor's degree on Economics and International Business
2016 - 2021

Work History

O

Outlier

Data labeling

Madrid
2024 - Present
A

Analistas Financieros Internacionales

Consultant on data analysis - EU public funds

Madrid
2022 - Present