Python Code Generation & RLHF for SaaS AI Agents
Lead the evaluation and refinement of AI-generated Python and JavaScript code for a video editing and website building SaaS (Vitra Frame/Soul Code). Performed expert-level RLHF by reviewing AI-generated Pull Requests (PRs), identifying logic errors, and rewriting inefficient code to create "Gold Standard" training samples. Conducted Red Teaming to identify security vulnerabilities and hallucinations in the model's output. Developed complex prompts to test the AI's ability to handle multi-step reasoning tasks in a production environment.