Expert RLHF for Advanced Algorithms and Python Systems
I provide expert-level RLHF and evaluation for LLMs focusing on Python, C++, and complex data structures. My work involves ranking model responses for algorithmic correctness, optimizing code for time/space complexity, and ensuring code safety. I specifically focus on identifying hallucinations in logic and verifying that code handles adversarial edge cases like integer overflows or null inputs.