Professional Domain Expert – AI Model Evaluation and Labeling
As a Professional Domain Expert at Outlier AI, I evaluated large language models in the medical and STEM domains. My responsibilities included prompt engineering, adversarial prompt design, and detailed assessment of AI-generated text. I provided qualitative feedback to improve model accuracy and training quality. • Developed prompts and adversarial scenarios for medical and STEM-based AI outputs • Evaluated AI-generated text for factual accuracy, completeness, and tone • Used rubric-based frameworks to assess response quality • Provided structured feedback to data science and engineering teams