AI Model Response Evaluator
Conducted side-by-side evaluation of AI model responses using DataCompute Workbench. Assessed outputs for accuracy, clarity, and adherence to given prompts. Provided structured feedback to improve LLM performance and ensure data quality for AI training. Followed defined quality guidelines to maintain consistency and reliability across all evaluations.