GDP Val
I designed complex AI evaluation tasks for a professional AI training project, creating domain-specific prompts and scoring rubrics to test the reasoning capabilities of advanced LLMs. Tasks were calibrated to target specific performance ranges and refined based on peer review feedback. Background in cloud architecture and enterprise IT provided strong domain expertise for generating technically accurate evaluation scenarios.