For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Jose Luis Sanchez Juarez

Jose Luis Sanchez Juarez

LLM Evaluation & AI Data Annotation Specialist (English & Spanish)

Mexico flagAcapulco, Mexico
$18.00/hrIntermediateAppenCrowdsourceData Annotation Tech

Key Skills

Software

AppenAppen
CrowdSourceCrowdSource
Data Annotation TechData Annotation Tech
LabelboxLabelbox
OneFormaOneForma
Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Classification
Evaluation Rating
RLHF
Text Generation
Translation Localization

Freelancer Overview

I have experience in AI data labeling, LLM evaluation, and search ranking assessments. I have contributed to multiple AI training projects involving preference ranking, truthfulness analysis, instruction following, and Reinforcement Learning from Human Feedback (RLHF). I have worked on popular platforms refining AI-generated responses for accuracy, fluency, and relevance. Additionally, I have evaluated search quality by assessing query relevance and ranking results to enhance AI-driven search engines. My expertise in English and Spanish NLP further strengthens my ability to provide high-quality annotations and linguistic assessments.

IntermediateEnglishSpanish

Labeling Experience

Data Annotation Tech

Search Quality & Query Evaluation

Data Annotation TechText
I have performed AI model evaluations using HELM (Holistic Evaluation of Language Models), and GEE (Generalized Evaluation for AI Systems) to assess response truthfulness, instruction following, coherence, specificity, fluency, and reasoning. Conducted structured pairwise ranking and fine-grained scoring to refine AI-generated outputs. Provided expert linguistic assessments and model improvement suggestions to enhance conversational AI performance and user satisfaction.

I have performed AI model evaluations using HELM (Holistic Evaluation of Language Models), and GEE (Generalized Evaluation for AI Systems) to assess response truthfulness, instruction following, coherence, specificity, fluency, and reasoning. Conducted structured pairwise ranking and fine-grained scoring to refine AI-generated outputs. Provided expert linguistic assessments and model improvement suggestions to enhance conversational AI performance and user satisfaction.

2024
Data Annotation Tech

LLM Evaluation & AI Response Ranking

Data Annotation TechTextEvaluation RatingPrompt Response Writing SFT
Performed in-depth AI model evaluations using Fine-Grained Criteria, assessing truthfulness, instruction following, coherence, specificity, fluency, and reasoning. Ranked AI-generated responses based on clarity, conciseness, and alignment with user intent. Provided structured preference rankings and qualitative feedback to enhance model performance and fine-tune language model outputs for improved interaction quality.

Performed in-depth AI model evaluations using Fine-Grained Criteria, assessing truthfulness, instruction following, coherence, specificity, fluency, and reasoning. Ranked AI-generated responses based on clarity, conciseness, and alignment with user intent. Provided structured preference rankings and qualitative feedback to enhance model performance and fine-tune language model outputs for improved interaction quality.

2024

Education

U

Universidad Nacional Autónoma de México

Bachelor's in Computer Engineering, Computer Engineering

Bachelor's in Computer Engineering
2017 - 2022

Work History

A

Arcus

Customer Experience Analyst

Acapulco
2024 - Present
A

Arcus

Assistant Product Owner

Acapulco
2023 - 2024