Reinforcement Learning from Human Feedback
The project focused on improving AI performance through Reinforcement Learning from Human Feedback (RLHF) in software engineering tasks. I evaluated and rated AI-generated code for correctness, efficiency, readability, and best practices, reviewing code completions, debugging solutions, algorithms, and technical explanations across multiple languages. Quality was ensured through structured rating scales, attention to accuracy, functionality, security, and coding standards, with consistent evaluations guided by clear rubrics and reliability checks.