Independent Red Teaming Researcher
Independently designed red teaming experiments exploring adversarial and edge-case prompt patterns for GPT-4, Claude 3, and Llama models. Analyzed response behavior to adversarial scenarios, instruction injection, and multi-turn escalation. Documented and catalogued findings to understand model vulnerabilities and performance variations. • Created categorized prompt catalog documenting observed outputs and behavior differences. • Focused on role-play scenarios and instruction/context manipulation challenges. • Maintained detailed reproduction notes for all adversarial tests conducted. • Ongoing personal project tracked with regular updates and practical experiments.