AI & Medical Content Contributor
Reviewed and labeled clinical text datasets to create high-quality training data for language models. Ensured domain-appropriate annotations and consistent taxonomy while evaluating LLM outputs for medical accuracy, clarity, and safety. Designed and iterated prompts to improve relevance and reliability of clinical model outputs. • Compared model responses to authoritative evidence and flagged errors for remediation. • Implemented quality-evaluation workflows using Excel, Notion, and Google Workspace. • Collaborated with reviewers to refine annotation guidelines for consistency. • Tracked review findings and communicated corrections to the training team.