AI Domain Expert/PhD Scholar (AI-generated content evaluator & LLM output assessor)
I evaluated AI-generated content and assessed Large Language Model outputs for accuracy and reasoning quality. My work included designing complex question-answer datasets and providing structured feedback to improve AI model performance. I utilized AI tools such as ChatGPT, Claude, and Gemini for these academic and research tasks. • Evaluated AI-generated responses for correctness, coherence, and relevance. • Designed question-answer pairs for LLM benchmarking in academic domains. • Provided detailed ratings and feedback on model outputs for quality improvement. • Applied prompt engineering strategies for research and teaching use cases.