Python & AI Systems Engineer (AI Training/Evaluation)
In this role, I evaluated and validated AI-generated Python code outputs, reviewing responses for correctness, logical consistency, and execution reliability. My work focused on identifying failure patterns in AI reasoning and improving the quality of LLM outputs at scale. I utilized internal Python-based tools for code assessment and automated workflow validation. • Assessed 100+ AI-generated Python code responses weekly • Designed and improved validation workflows for API-integrated AI systems • Contributed structured feedback to boost model reliability • Operated remotely in asynchronous AI training environments