Generative AI Specialist (Humanities)
As a Generative AI Specialist at Innodata, I evaluated and refined prompts and responses for large language models. I conducted rubric-based and pairwise output evaluations, focusing on accuracy, tone, style, and safety. I developed and enforced annotation guidelines to ensure consistency and applied adversarial and multilingual evaluation techniques. • Wrote, edited, and improved prompts and AI responses for LLM training and evaluation • Performed preference ranking and quality reviews according to evolving guidelines • Conducted adversarial testing to identify errors, hallucinations, bias, and policy risk • Authored annotation standards and gold labels across multiple languages (EN/ES/NL)