AI Training Specialist & Data Labeler | Revelo (Remote)
I executed high-precision data labeling and annotation tasks to enhance large language model (LLM) proficiency. My responsibilities included conducting reinforcement learning from human feedback (RLHF) to align model outputs with safety and accuracy standards. I also evaluated and rated large volumes of AI-generated code and prose for model improvement. • Annotated and ranked AI-generated text and code responses. • Implemented RLHF protocols for enhanced safety and technical accuracy. • Participated in iterative model evaluation cycles. • Worked remotely using proprietary and standard annotation tools.