Content Moderator / AI Data Annotator
As a Content Moderator and AI Data Annotator at Tech Mahindra, I performed text data annotation and human feedback tasks to enhance LLM-based AI models. My work focused on reviewing, classifying, and evaluating large volumes of user-generated content for trust, safety, and machine learning model training. I contributed directly to AI model alignment with human values through RLHF and continuous policy enforcement. • Annotated and labeled text content for LLM training and evaluation • Conducted reinforcement learning from human feedback (RLHF) to improve model outputs • Evaluated and rated AI-generated responses for accuracy and safety • Used proprietary/internal tooling (Datacompute, Raterhub, IDV, Rating.ewoq) and maintained >95% accuracy