Prompt Writing and Evaluation (Failure Testing)
In this project, I was responsible for crafting prompts within specific categories (e.g., chatbot, classification, Q&A, summarization, etc.). The goal was to design prompts in such a way that they would cause the AI to fail at least one of the given constraints. Once the AI responded, I compared the outputs, evaluated them against multiple criteria, and rewrote the prompt in a more effective manner. My task also involved providing detailed comments on the differences between the initial and corrected responses to ensure the AI's performance aligned with the required standards.