Safety & Policy Data Tagging
Tagged and categorized user-generated content under safety policies (self-harm, hate speech, misinformation, discrimination). Applied multi-layered classification rules and maintained high inter-annotator agreement rates to support policy-compliant model training.