For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
K

Karen Noyola

AI Language Evaluator (Contract / Project-Based)

USA flag
Dallas, Usa
$25.00/hrExpert

Key Skills

Software

No software listed

Top Subject Matter

AI Language Modeling
LLM Evaluation
Safety Assessment

Top Data Types

TextText
DocumentDocument
VideoVideo

Top Task Types

Object Detection
Question Answering
Text Summarization
Text Generation
Fine Tuning
Evaluation Rating

Freelancer Overview

AI Language Evaluator (Contract / Project-Based). Brings 6+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Master of Arts, Texas State University (2017) and Bachelor of Arts, University of South Florida (2015). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

ExpertSwahiliEnglish

Labeling Experience

AI Language Evaluator (Contract / Project-Based)

Text
As an AI Language Evaluator at Welocalize, contributed to AI training and model reliability through systematic prompt authoring, response evaluation, and fact verification. Consistently reviewed and rated AI-generated English and multilingual outputs for clarity, correctness, and safety alignment. Applied evolving guidelines for annotation and evaluation to ensure high data quality and inter-reviewer consistency. • Conducted adversarial testing by stress-testing models with ambiguous and edge-case prompts. • Delivered detailed written justifications for evaluation decisions supporting iterative quality improvements. • Flagged and documented systematic weaknesses in LLM behaviour such as overgeneralisation and cultural misalignment. • Performed side-by-side ranking of model outputs to assess tone drift, logical consistency, and safety risks.

As an AI Language Evaluator at Welocalize, contributed to AI training and model reliability through systematic prompt authoring, response evaluation, and fact verification. Consistently reviewed and rated AI-generated English and multilingual outputs for clarity, correctness, and safety alignment. Applied evolving guidelines for annotation and evaluation to ensure high data quality and inter-reviewer consistency. • Conducted adversarial testing by stress-testing models with ambiguous and edge-case prompts. • Delivered detailed written justifications for evaluation decisions supporting iterative quality improvements. • Flagged and documented systematic weaknesses in LLM behaviour such as overgeneralisation and cultural misalignment. • Performed side-by-side ranking of model outputs to assess tone drift, logical consistency, and safety risks.

2020 - Present

Education

T

Texas State University

Master of Arts, Linguistics

Master of Arts
2015 - 2017
U

University of South Florida

Bachelor of Arts, Linguistics and International Studies

Bachelor of Arts
2011 - 2015

Work History

M

Multilateral Development Programme

Language Analyst and Content Reviewer

Austin
2017 - 2020
U

University of South Florida

University Instructor, Linguistics and Language Use

Tampa
2015 - 2017