LLM Evaluator
Scale AITextEvaluation Rating
I provide a prompt for two LLMs and compare their response based on a few criteria such as localization, instruction following, and truthfulness
I provide a prompt for two LLMs and compare their response based on a few criteria such as localization, instruction following, and truthfulness
2024 - 2024