Software Engineer (AI Data Labeling & RLHF)
Led supervised fine-tuning by developing and validating high-quality, task-specific prompt and response datasets to improve model accuracy. Collaborated with trainers, built review/approval pipelines, and designed workflow history tracking for greater labeling reliability. Executed RLHF workflows in partnership with annotators, refining reward models to align AI output with user expectations. • Built analytics dashboards and LaTeX rendering for improved monitoring. • Engineered CI/CD pipelines for training and evaluating AI models. • Utilized Docker for reproducible environments. • Implemented secure role-based authentication with Firebase Google OAuth.