Data Scientist Generative AI Trainer
Designed and documented computational data science problems to serve as training and evaluation data for Generative AI systems. Authored complex Python-based tasks and multi-step analytical prompts, requiring advanced reasoning from AI models. Ensured outputs were deterministic and reproducible by using fixed random seeds throughout the labeling process. • Developed original data science challenges reflecting workflows in finance, healthcare, education, and e-commerce. • Programmatically verified and validated all expected solutions before submission. • Provided clear business context and thorough documentation for each prompt and answer set. • Focused on reproducibility, clarity, and ethical AI training outcomes.