AI Generalist Trainer
TextEvaluation Rating
Rate two separate AI responses based on 6 different ranking scales, determining which response offered higher quality. This was done for both text and image-based questions.
Rate two separate AI responses based on 6 different ranking scales, determining which response offered higher quality. This was done for both text and image-based questions.
2026 - 2026