Baldcypress
The purpose of this independent task is to evaluate the quality of AI-generated responses according to specific quality standards. The responses address the given question or prompt (both presented in the target language) and are assessed based on standardized quality evaluation criteria, which include ten distinct quality labels, including Safety,Accuracy,Language Alignment, Instruction Following, Relevance, Refusal, Comprehensiveness, Grammar&Fluency,Tone,Formating.