Multi-Modal Video Q&A Evaluation & Hallucination Correction
Executed high-precision evaluation for a Multi-Modal Large Language Model (MLLM) focused on video understanding in daily life scenarios. The workflow involved 'Video Grounding,' where I analyzed video clips to verify the accuracy of AI-generated Question-and-Answer (Q&A) pairs.