AI Trainer
Designed complex reasoning-based prompts and high-quality ground-truth answers to evaluate and improve AI agent performance. • Identified core cognitive skills under evaluation, including multi-step reasoning, nuance detection, and cross-source synthesis. • Created structured evaluation checklists and benchmarked responses from leading LLMs to ensure accuracy, consistency, and quality.