Abandoned Chat Analysis with AWS SageMaker Ground Truth
Led end-to-end data labeling project analyzing 50,000+ virtual chatbot transcripts to identify and classify chat abandonment patterns. Implemented AWS SageMaker Ground Truth workflows to create high-quality labeled training datasets for multi-class classification of abandonment reasons including: technical issues, user frustration, resolved queries, and genuine abandonment. Scope & Tasks Performed: - Set up custom labeling workflows and clear annotation guidelines - Coordinated annotation team to label chat transcripts by abandonment type - Applied quality control checks to ensure consistent and accurate labeling - Used active learning to automatically label similar data and reduce manual work - Created test datasets and monitored labeling quality throughout the project Project Size: 50,000+ chat transcripts across 4 abandonment categories