Exceptional Software Engineer
This project involved designing structured prompts and multi-turn conversational scenarios to evaluate large language model (LLM) performance across reasoning, instruction-following, and analytical capabilities. The scope included generating high-quality evaluation datasets, assessing model outputs, and producing structured feedback reports with performance ratings.