Project Blackbeard -- Outlier AI
TextEvaluation Rating
Evaluating responses from frontier AI models to STEM prompts spanning biochemistry, chemistry, and finance. The project specifically aimed to refine model responses to avoid hallucinations during complex STEM reasoning workflows.
Evaluating responses from frontier AI models to STEM prompts spanning biochemistry, chemistry, and finance. The project specifically aimed to refine model responses to avoid hallucinations during complex STEM reasoning workflows.
2026 - Present