Task Contributor / LLM Content Evaluator
As a Task Contributor and LLM Content Evaluator at Clickworker, I assessed large language model (LLM) responses for accuracy, relevance, and adherence to instructions. My responsibilities included side-by-side model comparison, factual correctness verification, and adherence to detailed evaluation guidelines. I performed structured grading using established frameworks and online research to support my judgments. • Evaluated the quality and relevance of AI-generated text responses • Verified factual correctness using research and structured criteria • Performed side-by-side comparisons of multiple language models • Utilized Clickworker's proprietary software for rating and assessment.