AI Trainer – RLHF and Conversational AI Empathy Tuning
As an AI Trainer, I refined large language models (LLMs) using reinforcement learning from human feedback (RLHF) methods. I applied cognitive behavioral therapy-based (CBT) linguistic patterns to enhance conversational AI's empathy and helpfulness. The project focused on optimizing sentiment, accuracy, and tone in AI-generated interactions. • Developed guidelines for assessing AI responses for alignment with human values and empathy. • Evaluated and rated natural language outputs to improve accuracy and conversational flow. • Provided continuous feedback and prompt engineering to iterate model performance. • Contributed subject matter expertise from psychology and wellness domains.