AI Logic & Structured Evaluation (Contract)
In this contract, I evaluated AI-generated outputs containing Python logic, pseudocode, and structured reasoning. My responsibilities included reviewing code for correctness and detecting logical flaws, faulty assumptions, and edge cases. I leveraged standardized rubrics to ensure evaluation consistency and reliability. • Reviewed Python and SQL code for coherence and validity. • Assessed output quality based on predefined rubrics. • Checked schema compliance for JSON and YAML formats. • Applied critical reasoning to flag ambiguities and inconsistencies.