Python and JavaScript RLHF and RLEF trainer – part-time
I worked as an AI model trainer, focusing on optimizing large language models using reinforcement learning from human feedback (RLHF) and reinforcement learning from execution feedback (RLEF). My responsibilities included designing and evaluating prompts to improve code generation, debugging, and performance tuning within both Python and JavaScript domains. The work enhanced model performance, accuracy, and contextual capabilities, supporting better AI-assisted coding experiences. • Optimized LLMs for contextual understanding with RLHF and RLEF. • Developed and evaluated prompts for code generation and debugging. • Focused on Python and JavaScript applications. • Ensured AI-generated outputs met high standards for coherence and accuracy.