AI Evaluator & Medical Annotation Specialist
As an AI Evaluator & Medical Annotation Specialist, I assessed LLM responses for clinical accuracy and safety in dental and healthcare domains. I designed and applied structured rubrics to evaluate AI output based on real dental radiographs and intraoral images. My work focused on identifying errors in diagnostic reasoning and promoting high-quality, accurate AI-generated medical responses. • Evaluated and compared AI-generated responses using weighted clinical rubrics. • Crafted high-complexity, image-based prompts for model testing. • Flagged errors like misdiagnosis or inappropriate treatments in model outputs. • Ensured ongoing quality assurance in a production environment.