Coding AI Trainer
I worked on coding projects applying Reinforcement Learning from Human Feedback (RLHF) to optimize AI systems. I utilized human feedback to fine-tune algorithms, improving performance and aligning AI decisions with business needs and human values