AI Data Ops LLM Software Engineer - Google
Evaluated and contributed to the development of large language model (LLM) evaluation workflows as part of the Gemini AI platform. Participated directly in coding and security review of LLM outputs, driving improvements in AI robustness. Developed and applied AI evaluation best practices collaboratively with cross-functional teams. • Assessed LLM-generated text using standardized evaluation protocols. • Provided feedback and ratings to improve AI generated outputs. • Documented findings on model performance and edge cases. • Used proprietary tools for LLM evaluation and feedback cycles.