AI Evaluation Specialist / LLM Reviewer
As an AI Evaluation Specialist and LLM Reviewer, I assessed AI-generated text outputs for accuracy, safety, and reasoning quality in safety-critical medical contexts. I designed domain-specific tasks and simulated expert-level workflows to train AI models on complex prompt completions. My work involved rubric-based evaluation, comparative response analysis, and quality assurance to ensure high-standard model outputs. • Evaluated and compared AI-generated text for hallucination detection and optimal output selection. • Developed and applied structured evaluation rubrics for consistency and reproducibility. • Conducted error analysis and designed correction strategies for model improvement. • Served as an AI Trainer and Final Reviewer, adhering to strict guidelines and workflow protocols.