AI Safety Red Teaming | Contract Consultant
As a contract consultant, I generated adversarial prompts targeting edge cases in AI safety classifiers. I applied category and severity labels grounded in policy guidelines and authored concise rationales for decisions. I performed high-volume labeling and reviewed peer submissions for accuracy and consistency. • Designed and executed adversarial prompt-based red teaming for safety classifiers. • Applied structured safety classification and severity scoring per requirements. • Composed policy-grounded rationales to justify classification choices. • Reviewed labeling outputs of others to ensure consistency and quality.