Legal AI Evaluation Specialist
Reviewed outputs from large language models for legal accuracy and reasoning. Evaluated prompt responses to identify inaccuracies, hallucinations, and completeness of legal language model outputs. Provided structured feedback on AI-generated legal reasoning. • Regularly assessed legal LLM model outputs for regulatory law scenarios. • Identified factual and logical discrepancies in AI legal responses. • Applied RLHF evaluation workflows in the legal domain. • Contributed expert-verified ratings and feedback for model improvement.