For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
J
Jorge Diaz

Jorge Diaz

AI Model Trainer – Language Model RLHF

Brazil flagPalhoca, Brazil
$10.00/hrIntermediateLabelboxOther

Key Skills

Software

LabelboxLabelbox
Other

Top Subject Matter

Natural Language Processing
Large Language Models
Autonomous Vehicles

Top Data Types

TextText
3D Sensor
VideoVideo

Top Task Types

RLHFRLHF
ClassificationClassification
MappingMapping

Freelancer Overview

AI Model Trainer – Language Model RLHF. Core strengths include Internal, Proprietary Tooling, and Labelbox. AI-training focus includes data types such as Text, 3D Sensor, and Geospatial and labeling workflows including RLHF, Classification, and Mapping.

IntermediateEnglish

Labeling Experience

Multimodal Data Labeler and Prompt Validator

OtherTextClassification
This position entailed categorizing multimodal data and validating complex prompts for generative AI models. The responsibility included checking technical quality and ensuring compliance with established style guides and accuracy metrics. Results directly influenced the reliability and creativity of AI-generated outputs. • Labeled and categorized multimodal datasets for AI training. • Validated and refined prompts for generative models. • Conducted technical quality assurance reviews. • Documented adherence to style and precision standards.

This position entailed categorizing multimodal data and validating complex prompts for generative AI models. The responsibility included checking technical quality and ensuring compliance with established style guides and accuracy metrics. Results directly influenced the reliability and creativity of AI-generated outputs. • Labeled and categorized multimodal datasets for AI training. • Validated and refined prompts for generative models. • Conducted technical quality assurance reviews. • Documented adherence to style and precision standards.

2025 - 2025

Geospatial Data Labeler and Quality Control Specialist

Mapping
In this project, I ensured quality control and verification of geospatial data for updating high-precision digital maps. Efforts involved resolving cartographic discrepancies and validating points of interest using satellite imagery. The process supported the integrity of map data for navigation and geographic applications. • Analyzed and validated geospatial datasets. • Resolved mapping inconsistencies through quality audits. • Labeled and updated digital map features and POIs. • Ensured adherence to mapping standards and protocols.

In this project, I ensured quality control and verification of geospatial data for updating high-precision digital maps. Efforts involved resolving cartographic discrepancies and validating points of interest using satellite imagery. The process supported the integrity of map data for navigation and geographic applications. • Analyzed and validated geospatial datasets. • Resolved mapping inconsistencies through quality audits. • Labeled and updated digital map features and POIs. • Ensured adherence to mapping standards and protocols.

2025 - 2025
Labelbox

3D Data Annotator – Traffic and Video Labeling

Labelbox3D SensorClassification
This role focused on annotating and labeling traffic signals in 3D environments to train autonomous driving systems. The task required precise analysis of high-speed video sequences for accurate asset identification. Labelers worked with complex scenes under varying conditions to ensure reliable model training. • Annotated 3D sensor data for object recognition. • Classified and labeled traffic signs and environmental assets. • Verified labeling accuracy through quality checks. • Collaborated in scenarios with diverse road and lighting conditions.

This role focused on annotating and labeling traffic signals in 3D environments to train autonomous driving systems. The task required precise analysis of high-speed video sequences for accurate asset identification. Labelers worked with complex scenes under varying conditions to ensure reliable model training. • Annotated 3D sensor data for object recognition. • Classified and labeled traffic signs and environmental assets. • Verified labeling accuracy through quality checks. • Collaborated in scenarios with diverse road and lighting conditions.

2024 - 2024

AI Model Trainer – Language Model RLHF

TextRLHF
This experience involved evaluating and optimizing responses from large language models (LLM) using Reinforcement Learning from Human Feedback (RLHF). Tasks included rating, ranking, and refining AI-generated outputs across diverse prompts and scenarios. Intensive quality control ensured the accuracy and consistency of training data throughout the process. • Conducted linguistic evaluation of LLM outputs. • Provided human feedback to train and fine-tune responses. • Followed strict guidelines to enhance output relevance and coherence. • Maintained records of feedback efficiency and model improvement.

This experience involved evaluating and optimizing responses from large language models (LLM) using Reinforcement Learning from Human Feedback (RLHF). Tasks included rating, ranking, and refining AI-generated outputs across diverse prompts and scenarios. Intensive quality control ensured the accuracy and consistency of training data throughout the process. • Conducted linguistic evaluation of LLM outputs. • Provided human feedback to train and fine-tune responses. • Followed strict guidelines to enhance output relevance and coherence. • Maintained records of feedback efficiency and model improvement.

2023 - 2024

Education

M

Monte carmelo

Secundaria, Bachiller

Secundaria
2014 - 2019

Work History

R

Remotask

Tasker

Ciudad Guayana
2023 - 2025