RWS Bard Rating
Evaluated LLM-generated responses for quality dimensions including truthfulness, writing quality, and prompt compliance. Performed fact-checking and comparative assessments of responses using reputable sources. Assessed various evaluation types including side-by-side comparisons and specialized factuality tasks with focus on accuracy verification.