Project Horizon - Expert Workflow Benchmarking
Demonstrate how real professionals execute domain workflows and compare that output against AI model responses. Produce expert-level prompts, responses, and evaluations used to benchmark and fine-tune frontier AI models.