Multilingual Toxicity Detection Labeler
For the Multilingual Toxicity Detection Dataset, I labeled over 25,000 social media text samples in English and Pidgin for hate speech, harassment, and toxicity. I applied nuanced severity categorization, maintaining high individual auditing accuracy. My annotation work enabled a clean dataset release for model training. • Labeled data across multiple languages for broader coverage. • Achieved a 97.3% final accuracy score on audit evaluations. • Enhanced dataset quality with nuanced severity tiering. • Contributed to a high-fidelity, released training set.