AI QA Specialist – AI Output Validation & LLM Testing
Conducted AI output validation and LLM testing across a B2B SaaS platform, applying prompt engineering to assess model reliability. Used prompt refinement and output benchmarking to systematically evaluate and improve AI-generated responses. Integrated multiple AI models into product QA workflows, focusing on maintaining high accuracy and reducing error rates. • Validated LLM and AI outputs for accuracy, coherence, and alignment with requirements • Refined prompts and performed model benchmarking for continuous improvement • Leveraged ChatGPT, Gemini, and Copilot for daily AI testing within a CI/CD environment • Maintained ~99% output validation accuracy during a 4-year engagement