Posted: Apr 3, 2026

Bilingual AI Safety Data Evaluator (English/Spanish C1+)

OtherTextEvaluation/RatingSpanishPay Per Hour

Overview

Dataset

Labeling Details

Hiring

Budget

Client

Job Description

Project Overview

Candidates should have a Bachelor's degree or higher in a relevant field such as Linguistics, Psychology, Law, Security, or Communications, or equivalent professional experience. Expert-level Spanish (near-native or native) and C1+ English proficiency are required. At least 5 years of experience in Trust & Safety, policy operations, or similar, plus documented LLM adversarial testing and localization experience are mandatory. Emotional resilience and ability to handle sensitive content are critical. In this project, experts will assess and label AI-generated outputs in Spanish and English, focusing on safety, correctness, and clarity. Tasks include spotting conceptual or policy errors, performing red-teaming to challenge system robustness, and rating responses based on policy alignment. You will annotate explicit content categories to improve large-model safety and performance.

Estimated Total Project Earnings: $1,400‑$2,400Intermediate3-6 monthsIndependent AI trainers

Estimated Total Project Earnings

$1,400‑$2,400

Pay per Hour

$14‑$24/hr

Time Requirement

Flexible

Duration

3-6 months

Labelers Needed

Description of dataset

AI-generated text and safety scenarios in Spanish and English

Software

Other

Hiring Type

Independent AI trainers

Required Location

Global - Any Location

Workload / Schedule

Weekly commitment can be adjusted based on throughput targets. Project duration is expected to run for 3 to 6 months. Labelers should follow milestone deadlines and quality checkpoints.

Software

Other

Data Type

Text

Task Types

Evaluation/Rating

RLHF

Text Generation

Subject Matter / Industry

AI safety and LLM content evaluation (Spanish & English)

Language

Spanish

Activity on this project

Proposals: 196

Invites sent: 0

Unanswered invites: 0

Share this project

Share link