LLM Response Evaluation
Evaluation, rating and justification of LLM responses against various type of prompts related to calendar, reminder, Google Workspace, Google Home and media playback tasks. This task also involves analysis of the tool call, function call, code parameters, sequencing and code output quality.