Generative AI Specialist (Humanities)
As a Generative AI Specialist at Innodata, I wrote, edited, and refined prompts and responses to train and evaluate large language models. My responsibilities included evaluating and ranking AI-generated outputs using comparative and rubric-based methods, identifying safety, accuracy, and compliance issues. I authored and implemented detailed annotation guidelines and performed multilingual evaluation to ensure high-quality data labeling outcomes. • Adversarially tested model responses for logical errors, bias, hallucinations, and policy risks. • Applied and updated gold standard annotation protocols to maintain evaluation consistency. • Performed fact-checking and source verification during data labeling tasks. • Delivered high-volume, guideline-driven annotation and evaluation remotely.