LLM Response Evaluation and Ranking
Evaluated and ranked large language model outputs based on accuracy, reasoning quality, instruction adherence, and tone. Provided detailed justifications and corrective feedback to support model fine-tuning and performance improvements.