QA Automation Data Lead (LLM Data QA and RLHF)
I led the QA Automation Data team, focusing on preparing and validating multilingual text datasets essential for LLM evaluation and training. My responsibilities included establishing LLM-aligned data pipelines, applying RLHF and SFT techniques, and enforcing high-quality standards across diverse language corpora. I coordinated documentation and process guidelines using LaTeX, ensuring structured and reliable data flow. • Supervised dataset creation and validation for RLHF and SFT tasks • Implemented and refined LLM 'Golden Set' validation protocols • Guided technical annotation in English, Telugu, Hindi, and Tamil • Oversaw use of internal/proprietary automation and documentation tools