Project Neon Origin
- Evaluated AI model responses in their adherence to Safety Frameworks and general prompt responses to Red-teaming prompts. - Analyzed prompt conversation history and system instructions to assess response quality. - Documented strengths, weaknesses, and provided structured quality scores for model outputs. - Ranked model responses based on overall performance. - Work was done within the Spanish domain, where prompts and responses were written in Spanish.