AI Behavioral Evaluation Engineer
- Evaluated AI-generated TypeScript FullStack applications for correctness, architecture, and production readiness. - Conducted structured behavioral analysis (hallucinations, scope creep, verification gaps, tool misuse). - Validated Docker-based environments, frontend/ backend integration, and test execution. - Performed comparative A/ B model benchmarking and quality scoring across interaction cycles.