AI Training Data Annotation for NLP & Content Classification
Worked on multiple AI training data projects focused on text annotation for large language models. Responsibilities included labeling and classifying text data for intent, sentiment, policy compliance, and relevance. Performed Named Entity Recognition (NER), response quality evaluation, and RLHF-style ranking to improve model alignment and accuracy. The project involved annotating thousands of data points while strictly following evolving guidelines and handling complex edge cases. Quality was maintained through consistency checks, peer reviews, and adherence to high accuracy thresholds.