Jorge Diaz - AI Model Trainer – Language Model RLHF

Key Skills

Software

Labelbox

Other

Top Subject Matter

Natural Language Processing

Large Language Models

Autonomous Vehicles

Top Data Types

Text

3D Sensor

Video

Top Task Types

RLHF

Classification

Mapping

Freelancer Overview

AI Model Trainer – Language Model RLHF. Core strengths include Internal, Proprietary Tooling, and Labelbox. AI-training focus includes data types such as Text, 3D Sensor, and Geospatial and labeling workflows including RLHF, Classification, and Mapping.

IntermediateEnglish

Labeling Experience

Multimodal Data Labeler and Prompt Validator

OtherTextClassification

This position entailed categorizing multimodal data and validating complex prompts for generative AI models. The responsibility included checking technical quality and ensuring compliance with established style guides and accuracy metrics. Results directly influenced the reliability and creativity of AI-generated outputs. • Labeled and categorized multimodal datasets for AI training. • Validated and refined prompts for generative models. • Conducted technical quality assurance reviews. • Documented adherence to style and precision standards.

2025 - 2025

Geospatial Data Labeler and Quality Control Specialist

Mapping

In this project, I ensured quality control and verification of geospatial data for updating high-precision digital maps. Efforts involved resolving cartographic discrepancies and validating points of interest using satellite imagery. The process supported the integrity of map data for navigation and geographic applications. • Analyzed and validated geospatial datasets. • Resolved mapping inconsistencies through quality audits. • Labeled and updated digital map features and POIs. • Ensured adherence to mapping standards and protocols.

2025 - 2025

3D Data Annotator – Traffic and Video Labeling

Labelbox3D SensorClassification

This role focused on annotating and labeling traffic signals in 3D environments to train autonomous driving systems. The task required precise analysis of high-speed video sequences for accurate asset identification. Labelers worked with complex scenes under varying conditions to ensure reliable model training. • Annotated 3D sensor data for object recognition. • Classified and labeled traffic signs and environmental assets. • Verified labeling accuracy through quality checks. • Collaborated in scenarios with diverse road and lighting conditions.

2024 - 2024

AI Model Trainer – Language Model RLHF

TextRLHF

This experience involved evaluating and optimizing responses from large language models (LLM) using Reinforcement Learning from Human Feedback (RLHF). Tasks included rating, ranking, and refining AI-generated outputs across diverse prompts and scenarios. Intensive quality control ensured the accuracy and consistency of training data throughout the process. • Conducted linguistic evaluation of LLM outputs. • Provided human feedback to train and fine-tune responses. • Followed strict guidelines to enhance output relevance and coherence. • Maintained records of feedback efficiency and model improvement.

2023 - 2024

Education

M

Monte carmelo

Secundaria, Bachiller

Secundaria

2014 - 2019

Work History

R

Remotask

Tasker

Ciudad Guayana

2023 - 2025