Candidates should have a Bachelor's degree or higher in a relevant field such as Linguistics, Psychology, Law, Security, or Communications, or equivalent professional experience. Expert-level Spanish (near-native or native) and C1+ English proficiency are required. At least 5 years of experience in Trust & Safety, policy operations, or similar, plus documented LLM adversarial testing and localization experience are mandatory. Emotional resilience and ability to handle sensitive content are critical. In this project, experts will assess and label AI-generated outputs in Spanish and English, focusing on safety, correctness, and clarity. Tasks include spotting conceptual or policy errors, performing red-teaming to challenge system robustness, and rating responses based on policy alignment. You will annotate explicit content categories to improve large-model safety and performance.
$1,400‑$2,400
$14‑$24/hr
Flexible
3-6 months
10
AI-generated text and safety scenarios in Spanish and English
Software
Hiring Type
Required Location
Workload / Schedule
Weekly commitment can be adjusted based on throughput targets. Project duration is expected to run for 3 to 6 months. Labelers should follow milestone deadlines and quality checkpoints.
Software
Data Type
Task Types
Subject Matter / Industry
Language
Proposals: 196
Invites sent: 0
Unanswered invites: 0
Share link