AI tutor data science
Design original computational data science problems that simulate real-world analytical workflows across industries (telecom, finance, government, e-commerce, healthcare). Create problems requiring Python programming to solve (using pandas, numpy, scipy, sklearn, statsmodels, matplotlib, seaborn). Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). Develop problems requiring non-trivial reasoning chains in data processing, statistical analysis, feature engineering, predictive modeling, and insight extraction. Create deterministic problems with reproducible answers: avoid stochastic elements or require fixed random seeds for exact reproducibility. Base problems on real business challenges: customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency. Design end-to-end problems spanning the complete data science pipeline (data ingestion → cleaning → EDA → modeling → validatio