AI Data Evaluator — Independent Project
Developed and executed an independent project focused on the hands-on evaluation of AI-generated responses. Assessed AI output across factual accuracy, reasoning, hallucination detection, and completeness dimensions. Provided structured feedback and label verification according to defined evaluation guidelines. • Evaluated multi-step reasoning responses and detected logical errors. • Performed numerical and probability problem-solving to test LLM performance. • Conducted data QA and annotation, verifying label accuracy. • Designed and validated logic problems to assess AI solution quality.