Candidates must have a degree in Computer Science or a related field and at least two years of professional, research, or teaching experience, with demonstrated proficiency in Python for numerical validation or simulation (using libraries like NumPy, Pandas, SciPy). Strong technical writing skills in English at a C1+ level, experience with structured evaluation of multi-step reasoning, and the ability to analyze correctness, assumptions, and constraints are essential. Familiarity with at least one additional language or scientific computing tool is a plus. You will design rigorous, industry-relevant computer science problems, evaluate AI-generated solutions using structured criteria, validate answers with Python, and contribute expert feedback to improve AI logic and reasoning. This role requires collaboration with a global expert community to ensure consistent high-quality evaluation standards and scientific integrity across projects.
Estimated Total Earnings
$800.00
Pay per Hour
$40.00/hr
Time Requirement
20+ hrs/week
Duration
3-6 months
Computer science problem design and evaluation
Software
Hiring Type
Required Location
Workload / Schedule
Expected weekly commitment is 20 hours. Project duration is expected to be 3-6 months. Labelers should follow milestone deadlines and quality checkpoints.
Software
Data Type
Task Types
Subject Matter / Industry
Language
Proposals: 20
Invites sent: 0
Unanswered invites: 0
Share link