AI Evaluation & Analysis Sandbox — Project Hedgehog Prep
Designed a local evaluation sandbox replicating Project Hedgehog workflows with open-source LLM tools. Developed custom rubrics and created synthetic datasets for prompt engineering, output analysis, and hallucination detection. Practiced annotation techniques and feedback writing in an environment aligning with real-world AI training protocols. • Scored AI outputs on multiple dimensions • Tested model reasoning, summarization, and instruction following • Used project-aligned practice materials and guidelines • Simulated large-scale data labeling and evaluation cycles