AI Trainer, Mercor, Welocalize
In my role as an AI Trainer for Mercor and Welocalize in 2026, I assisted in refining conversational AI through Reinforcement Learning from Human Feedback (RLHF). Tasks included analyzing dialogue, ranking AI responses, and annotating data to optimize LLM behavior. The position emphasized linguistic precision, cultural sensitivity, and effective data annotation practices. • Conducted detailed RLHF data annotation on text-based AI outputs • Ranked and scored multiple response options per prompt • Annotated sensitive topics with accuracy and discretion • Leveraged Mercor and Welocalize’s platforms for collaborative remote work