LLM Evaluator
Evaluated large language model outputs by checking transcription accuracy, comparing responses to identical prompts by scoring outputs using a predefined Likert scale. Provided objective evaluations to refine LLM performance and ensured data consistency through careful review and analysis.