AI Trainer
Spearheaded the evaluation and optimization of Large Language Models (LLMs) via Reinforcement Learning from Human Feedback (RLHF), engineering high-complexity Chain-of-Thought (CoT) prompts to train models on advanced logic and reasoning paths. Rigorously validated State-of-the-Art (SOTA) solutions across multiple programming languages (Python, SQL, C++), performing deep-dive debugging and technical analysis to ensure AI-generated code is production-ready, secure, and efficient. Developed and solved complex coding challenges in Math Coding, Data Analysis, and 2D/3D Animation environments, establishing the "Ground Truth" for specialized technical queries that require high-precision engineering standards. Applied software engineering best practices (DevOps, CI/CD logic) to evaluate AI performance in deployment scenarios, designing granular grading rubrics to measure code quality, correctness, and style against strict industry benchmarks.