AI Data Labeling and Model Evaluation
Currently working on AI training data labeling and model evaluation tasks. Classified prompts and responses across multiple categories including intent, topic, and response quality. Evaluated AI generated answers for accuracy, relevance, and clarity. Performed question answering validation by reviewing responses and identifying incorrect or incomplete outputs. Flagged edge cases and ambiguous prompts to improve dataset quality. Worked across different task types including reasoning, summarisation, and general knowledge queries. Maintained consistency by following detailed annotation guidelines and conducting quality checks.