For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Pedro Deniz

Pedro Deniz

Expert in Software development, Securtiy and AWS

Spain flagMadrid, Spain
$30.00/hrIntermediateData Annotation TechLabelboxMindrift

Key Skills

Software

Data Annotation TechData Annotation Tech
LabelboxLabelbox
MindriftMindrift

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
ImageImage
TextText

Top Task Types

Computer Programming Coding
Evaluation Rating

Freelancer Overview

I have mainly worked in AI training comparing different models responses to code related tasks. This work involved asking two different models to perform a code task such as: new request feature, creation of tests, refactoring, creation of documentation among others. Also, I have tested security testing of models, having bypassed a model security boundaries and using it to download and execute custom created malware. I have also worked in evaluating text model responses in terms of factuality, instruction following, writing tone and other aspects depending on the customer's request. This work also included comparing the responses of two models to the same prompt.

IntermediateEnglishSpanish

Labeling Experience

Labelbox

Python LLM response comparison

LabelboxComputer Code Programming
This project´s objective was to compare the coding skills of two models by asking the trainer to ask them to perform the same task over several prompt-response turns and evaluate both model´s solutions

This project´s objective was to compare the coding skills of two models by asking the trainer to ask them to perform the same task over several prompt-response turns and evaluate both model´s solutions

2025
Labelbox

Evaluating model's Function calling skills

LabelboxAudioFunction Calling
This project evaluated the accuracy of a model to call the appropriate functions to retrieve the information requested by a user. Depending on the context (healthcare or customer support) there were a set of functions which the method could call. I needed to evaluate that the model called the right functions, at the right time and with the right parameters.

This project evaluated the accuracy of a model to call the appropriate functions to retrieve the information requested by a user. Depending on the context (healthcare or customer support) there were a set of functions which the method could call. I needed to evaluate that the model called the right functions, at the right time and with the right parameters.

2025 - 2025
Mindrift

Anthropic Claude security testing

MindriftComputer Code ProgrammingComputer Programming Coding
The project was testing how well Anthropic's Claude security boundaries prevented using the model to be used to download and install malware using public social network profiles.

The project was testing how well Anthropic's Claude security boundaries prevented using the model to be used to download and install malware using public social network profiles.

2025 - 2025

Education

C

Concordia University

M(Eng.) in Information Systems Security, Computer Systems Security

M(Eng.) in Information Systems Security
2014 - 2015
C

Concordia University

Master of Engineering, Information Systems Security

Master of Engineering
2014 - 2015

Work History

A

Activex Servicios Integrales

AWS Security Architect

N/A
2022 - Present
T

Tata Consultancy Services

AWS Security Architect

N/A
2021 - 2022