AI Coding Specialist / Technical Evaluator
In this role, I evaluated and ranked code generated by Large Language Models (LLMs) for correctness, following precise logic and style guidelines. I contributed high-quality 'Gold Standard' code solutions to train models in Python, SQL, and JavaScript. Using RLHF, I identified and addressed model hallucinations, syntax errors, and logic bottlenecks. • Evaluated code output from LLMs for functional accuracy. • Delivered Gold Standard coding examples to enhance model reasoning. • Applied RLHF methodologies to improve AI model integrity. • Worked closely with engineers to refine prompt engineering strategies.