AI Training Prompt Writer
During my time with Mercor, I focused on creating adversarial medical prompts designed to induce model failure in clinical reasoning and safety. I developed realistic, high complexity scenarios that tested diagnostic accuracy, treatment decisions, contraindications, and risk assessment, specifically aiming to expose hallucinations, unsafe recommendations, and flawed reasoning. The project operated at scale with structured review processes, including calibration and secondary validation to ensure prompts were clinically accurate, sufficiently challenging, and effective at differentiating model performance. This work required deep medical expertise, precision in language, and a clear understanding of how AI systems fail in healthcare contexts.