AI Response Evaluation
I evaluated AI responses in Korean using sets of criteria given by the client. On the projects, I evaluated the AI responses on the grounds of truthfulness, which checks the factuality of the information found in the response and instruction following, which pertains to whether the response addressed the prompt's explicit and implicit intentions. I mostly did evaluation tasks for different formats such as text, image, and voice responses.