AI Trainer & Python Developer (RLHF Data Labeling)
Acted as an AI Trainer responsible for reinforcement learning from human feedback (RLHF) on Python code. Reviewed, debugged, and evaluated AI-generated code solutions to ensure alignment with Gold Standard programming practices. Collaborated in the development of workflow documentation for AI training datasets. • Processed and rated over 500 complex algorithmic problems for training large language models. • Provided granular feedback to AI systems improving coding capabilities and logic. • Rewrote and corrected AI-generated solutions in Python and Docker environments. • Utilized deep understanding of data structures and algorithms for high-quality data labeling.