Code evaluator and data annotator
Worked on an internal project assessing the correctness, structure, and logic of model-generated code. Tasks included detecting functional errors, reviewing code snippets, validating outputs against instructions, and writing high-quality corrections or improved prompts. Also performed classification and RLHF-style evaluations to guide model behaviour. The project required precise analytical skills, careful documentation, and strong consistency in applying detailed technical guidelines.