Rubric Editing and Evaluation Instruction review for Failure Axis Alignment of LLM Chatbots
Review multi-turn conversations, evaluate pre-provided rubric criteria against three LLM chatbot model responses, edit rubrics when necessary, and rate each rubric on quality dimensions.