Bilingual Prompt-Based Data Labeling for AI Model Training
I worked on a large-scale data labeling project focused on generating and annotating English and Hindi prompts and responses for training and evaluating large language models. Tasks included creating diverse prompt-response pairs, classifying intent and sentiment, and translating content between English and Hindi to ensure high-quality bilingual datasets. I also participated in prompt evaluation and red teaming for AI safety, adhering to strict quality guidelines and accuracy benchmarks. The project involved labeling over 20,000 text samples and required both linguistic expertise and programming skills for data validation and automation.