Action Item
I worked on a project focused on evaluating and improving an AI system’s ability to extract accurate action items from meeting transcripts. My responsibilities included reviewing AI-generated action items, verifying whether they were correctly derived from the transcript, and determining when items should be added, removed, or rewritten for clarity and correctness. In addition to identifying and validating action items, I assessed the model’s outputs along key dimensions such as truthfulness, groundedness, and instruction following. This involved checking whether each action item was fully supported by the transcript, ensuring there were no hallucinations or fabricated tasks, and confirming that the model followed formatting rules, task-extraction guidelines, and enumeration instructions. I evaluated completeness, relevance, logical consistency, and alignment with the original discussion while ensuring compliance with project standards. My work contributed to improving the model’s re