Social Media Toxicity & Hate Speech Annotation
Annotated social media comments and posts to detect and classify various forms of toxic content such as threats, obscene language, insults, and identity-based hate. Each record was evaluated for multiple toxicity subtypes, with precise classification to support fine-grained machine learning model training. Annotations guided content moderation systems to automatically flag and review harmful or policy-violating user-generated content, ultimately supporting safer digital communities. Special attention was given to ambiguous cases via consensus annotation and context-based decisions