Biriba Zb - AI Data Evaluator - Technology & Internet

Key Skills

Software

Labelbox

Top Subject Matter

No subject matter listed

Top Data Types

Text

Top Label Types

Classification

Evaluation Rating

Data Collection

Prompt Response Writing SFT

Audio Recording

Freelancer Overview

I am an experienced AI data evaluator and junior data analyst with a strong focus on data labeling, annotation, and improving AI model behavior. My background includes evaluating AI-generated outputs for correctness, safety, and relevance, consistently achieving 95–98% accuracy by following detailed rating guidelines and frameworks. I am skilled in data cleaning, transformation, and visualization using tools like R, SQL, spreadsheets, and ggplot2, and have hands-on experience with annotation tools and prompt evaluation for LLMs. My work in remote environments has honed my attention to detail and reliability, while my customer service experience in e-commerce has strengthened my communication and problem-solving skills. I am passionate about ensuring high-quality training data to support effective AI systems.

IntermediateEnglishSpanish

Labeling Experience

Search Relevance & LLM Response Evaluation Specialist

LabelboxTextClassificationEvaluation Rating

Worked on large-scale AI training and evaluation projects focused on search relevance, language quality, and LLM response assessment. Responsibilities included rating search results and model-generated responses based on detailed guideline frameworks covering relevance, accuracy, helpfulness, safety, and linguistic quality. Performed fine-grained evaluation of model outputs for fluency, tone, coherence, and instruction-following. Identified issues related to ambiguity, hallucinations, bias, and factual inconsistencies. Contributed to improving model performance by providing structured feedback on error patterns and edge cases. Handled high-volume annotation tasks while maintaining strict quality standards and consistency across evolving guideline updates. Regularly worked with nuanced judgment scenarios requiring deep reading comprehension, contextual reasoning, and linguistic sensitivity. Maintained strong quality metrics through adherence to calibration feedback, consensus alignmen

2024

Education

C

Centro de estudios Educare A.C.

DEGREE, HIGHSCHOOL

DEGREE

2013 - 2016

Work History

E

Eco-Compu

Founder

Mérida

2024 - Present

A

Amazon

Customer Service Associate

Mérida

2022 - 2024