AI evaluator
Data Annotation TechTextFine TuningEvaluation Rating
several projects aiming to fine tune and evaluate different LLMs. basis of evaluation would differs such as truthfulness, instruction following and localization
several projects aiming to fine tune and evaluate different LLMs. basis of evaluation would differs such as truthfulness, instruction following and localization
2024 - 2025