IT Operations & Support / Data Annotation & AI Training
Design, review, and validate evaluation scenarios for autonomous AI agents, ensuring tests are logically sound, realistic, and aligned with intended agent behaviors. Reason about complex systems and policies as a human reviewer to guarantee agents are evaluated against clear, robust standards.