Generalist Expert
TextEvaluation Rating
The project involves evaluating model responses to questions (prompts) in various ways (rubrics, ranking, etc.).
The project involves evaluating model responses to questions (prompts) in various ways (rubrics, ranking, etc.).
2026 - Present