AI Agent Evaluation & Training
Review and evaluate user–agent conversation trajectories for AI systems in task-execution and job-management scenarios. Assess agent behavior against defined rules and quality guidelines, create scoring criteria and rationales, and correct AI outputs to improve accuracy, compliance, and overall performance.