Data Labeling Specialist (RLHF, QA, Multimodal Labeling)
As a data labeling specialist, performed RLHF evaluation and annotated model outputs for training AI systems. Reviewed and scored AI-generated text responses, ensuring adherence to detailed rubrics in technical, code, and general domains. Specialized in instruction-following, helpfulness/harmlessness scoring, and identifying edge cases in STEM-heavy datasets. • Consistently applied labeling guidelines across multiple annotation platforms. • Conducted inter-annotator agreement and quality assurance audits for annotation consistency. • Labeled and evaluated code snippets, bug identification tasks, and technical documentation. • Utilized platforms such as Scale AI, Surge AI, Labelbox, and custom annotation UIs for large-scale labeling workflows.