Senior AI Training Consultant
As Lead Medical SME for LLM training projects at Scale AI, I developed Golden Response datasets for pediatric surgical cases. I performed Multi-Turn Dialogue Evaluation to rate AI model responses for safety, logic, and helpfulness. I regularly debugged and rewrote AI-generated Python code used for clinical data analysis. • Created factual and precise model responses for medication dosage and surgical guidance • Evaluated model outputs using HHH metrics with a structured grading rubric • Ensured dataset integrity for pediatric STEM reasoning and clinical safety • Provided expert feedback to AI teams to align models with medical standards