LLM Evaluation Contributor in Japanese
Scale AITextText GenerationEvaluation Rating
An A-B rating of the response output by the prompt. In accordance with the guidelines, I assessed the accuracy, responsiveness to the prompts and usefulness of the responses.
An A-B rating of the response output by the prompt. In accordance with the guidelines, I assessed the accuracy, responsiveness to the prompts and usefulness of the responses.
2024