For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Biriba Zb

Biriba Zb

AI Data Evaluator - Technology & Internet

MEXICO flag
Merida, Mexico
$8.00/hrIntermediateLabelbox

Key Skills

Software

LabelboxLabelbox

Top Subject Matter

No subject matter listed

Top Data Types

TextText

Top Label Types

Classification
Evaluation Rating
Data Collection
Prompt Response Writing SFT
Audio Recording

Freelancer Overview

I am an experienced AI data evaluator and junior data analyst with a strong focus on data labeling, annotation, and improving AI model behavior. My background includes evaluating AI-generated outputs for correctness, safety, and relevance, consistently achieving 95–98% accuracy by following detailed rating guidelines and frameworks. I am skilled in data cleaning, transformation, and visualization using tools like R, SQL, spreadsheets, and ggplot2, and have hands-on experience with annotation tools and prompt evaluation for LLMs. My work in remote environments has honed my attention to detail and reliability, while my customer service experience in e-commerce has strengthened my communication and problem-solving skills. I am passionate about ensuring high-quality training data to support effective AI systems.

IntermediateEnglishSpanish

Labeling Experience

Labelbox

Search Relevance & LLM Response Evaluation Specialist

LabelboxTextClassificationEvaluation Rating
Worked on large-scale AI training and evaluation projects focused on search relevance, language quality, and LLM response assessment. Responsibilities included rating search results and model-generated responses based on detailed guideline frameworks covering relevance, accuracy, helpfulness, safety, and linguistic quality. Performed fine-grained evaluation of model outputs for fluency, tone, coherence, and instruction-following. Identified issues related to ambiguity, hallucinations, bias, and factual inconsistencies. Contributed to improving model performance by providing structured feedback on error patterns and edge cases. Handled high-volume annotation tasks while maintaining strict quality standards and consistency across evolving guideline updates. Regularly worked with nuanced judgment scenarios requiring deep reading comprehension, contextual reasoning, and linguistic sensitivity. Maintained strong quality metrics through adherence to calibration feedback, consensus alignmen

Worked on large-scale AI training and evaluation projects focused on search relevance, language quality, and LLM response assessment. Responsibilities included rating search results and model-generated responses based on detailed guideline frameworks covering relevance, accuracy, helpfulness, safety, and linguistic quality. Performed fine-grained evaluation of model outputs for fluency, tone, coherence, and instruction-following. Identified issues related to ambiguity, hallucinations, bias, and factual inconsistencies. Contributed to improving model performance by providing structured feedback on error patterns and edge cases. Handled high-volume annotation tasks while maintaining strict quality standards and consistency across evolving guideline updates. Regularly worked with nuanced judgment scenarios requiring deep reading comprehension, contextual reasoning, and linguistic sensitivity. Maintained strong quality metrics through adherence to calibration feedback, consensus alignmen

2024

Education

C

Centro de estudios Educare A.C.

DEGREE, HIGHSCHOOL

DEGREE
2013 - 2016

Work History

E

Eco-Compu

Founder

Mérida
2024 - Present
A

Amazon

Customer Service Associate

Mérida
2022 - 2024