Data Annotator
As a Data Annotator at Hugo Technologies, I analyze and label large-scale text datasets to support the training and refinement of Large Language Models. I perform Reinforcement Learning from Human Feedback (RLHF) to directly improve model reasoning and conversational flow. My responsibilities include evaluating model outputs and reporting data edge cases to maintain high training standards. • Labeled diverse text data to enhance natural language understanding and generation. • Conducted RLHF-driven evaluations for accuracy, safety, and project compliance. • Identified and reported ambiguities to ensure data integrity and model performance. • Contributed to the overall quality and representativeness of the dataset.