For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Sergio Nuñez

Software Engineer for Training AI Data (RLHF Annotator)

PERU flag
Remote, Peru
$25.00/hrIntermediateOtherScale AI

Key Skills

Software

Other
Scale AIScale AI

Top Subject Matter

AI model training
Rlhf Domain Expertise
Python Domain Expertise

Top Data Types

TextText
AudioAudio
Computer Code ProgrammingComputer Code Programming

Top Task Types

RLHF
Data Collection
Question Answering

Freelancer Overview

Software Engineer for Training AI Data (RLHF Annotator). Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Data Labeling and Python. Education includes Bachelor of Science, Peruvian University of Applied Sciences (2024) and High School Diploma, Anglo-American Prescott School (2018). AI-training focus includes data types such as Text and labeling workflows including RLHF and Data Collection.

IntermediateEnglishSpanish

Labeling Experience

Software Engineer for Training AI Data (RLHF Annotator)

OtherTextRLHF
As a Software Engineer for Training AI Data at Outlier, I participated in RLHF (Reinforcement Learning from Human Feedback) projects specifically related to data science, Python programming, and Spanish language tasks. My work involved providing human feedback, evaluating model responses, and fine-tuning large language models through RLHF methodologies. I contributed to generalist RLHF projects and ensured the quality of AI responses aligned with expected standards. • Evaluated and rated AI model outputs in Spanish and English. • Provided feedback for Python programming-focused RLHF tasks. • Participated in human-in-the-loop AI training for data science projects. • Collaborated remotely and maintained high annotation consistency.

As a Software Engineer for Training AI Data at Outlier, I participated in RLHF (Reinforcement Learning from Human Feedback) projects specifically related to data science, Python programming, and Spanish language tasks. My work involved providing human feedback, evaluating model responses, and fine-tuning large language models through RLHF methodologies. I contributed to generalist RLHF projects and ensured the quality of AI responses aligned with expected standards. • Evaluated and rated AI model outputs in Spanish and English. • Provided feedback for Python programming-focused RLHF tasks. • Participated in human-in-the-loop AI training for data science projects. • Collaborated remotely and maintained high annotation consistency.

2024 - Present

Undergraduate Thesis Author (AI model for Hate Speech Detection)

TextData Collection
As the author of the undergraduate thesis 'NoHateS: An AI Model for the Automatic Detection of Hate Speech in Social Interaction Platforms', I performed data collection, cleaning, merging, and augmentation for hate speech detection in Spanish text. I fine-tuned the BETO (Spanish BERT) model and developed multiple neural network architectures for the task. I designed, prepared, and labeled large datasets to train and test AI models for detecting hate speech. • Collected and curated publicly available data for training AI models. • Annotated and cleaned Spanish-language social media data for hate speech. • Applied data augmentation using NLP techniques (NLPaug). • Compared annotation and model results across different architectures.

As the author of the undergraduate thesis 'NoHateS: An AI Model for the Automatic Detection of Hate Speech in Social Interaction Platforms', I performed data collection, cleaning, merging, and augmentation for hate speech detection in Spanish text. I fine-tuned the BETO (Spanish BERT) model and developed multiple neural network architectures for the task. I designed, prepared, and labeled large datasets to train and test AI models for detecting hate speech. • Collected and curated publicly available data for training AI models. • Annotated and cleaned Spanish-language social media data for hate speech. • Applied data augmentation using NLP techniques (NLPaug). • Compared annotation and model results across different architectures.

2022 - 2024

Author of Paper (Data Labeling for NLP AI)

TextData Collection
As the author of the paper 'NoHateS: A Transformers-based Approach for Real-Time Hate Speech Detection in Spanish', I managed data collection, annotation, and augmentation for transformers-based hate speech detection. I compared models on labeled datasets and used data augmentation methods to enhance model robustness. I performed thorough dataset preparation and quality assurance for publication. • Curated and annotated Spanish hate speech data from social platforms. • Utilized data augmentation to expand training material diversity. • Compared efficacy of CNN and LSTM architectures for NLP tasks. • Prepared open-source datasets and code for community use.

As the author of the paper 'NoHateS: A Transformers-based Approach for Real-Time Hate Speech Detection in Spanish', I managed data collection, annotation, and augmentation for transformers-based hate speech detection. I compared models on labeled datasets and used data augmentation methods to enhance model robustness. I performed thorough dataset preparation and quality assurance for publication. • Curated and annotated Spanish hate speech data from social platforms. • Utilized data augmentation to expand training material diversity. • Compared efficacy of CNN and LSTM architectures for NLP tasks. • Prepared open-source datasets and code for community use.

2023 - 2023

Author of Paper (AI Data Collection & Annotation)

TextData Collection
As author of the paper 'NoHateS: A Hate Speech Detection System in Spanish Using Transformers-based models', I led continued data collection, cleaning, and augmentation for improving AI hate speech detection. I developed and compared several transformer-based models on annotated Spanish social interaction data. My work emphasized high-quality data preprocessing and evaluation to optimize model performance. • Collected and annotated additional hate speech datasets in Spanish. • Enhanced datasets with targeted cleaning and NLP-based augmentation. • Optimized and evaluated model architectures using labeled data. • Provided dataset curation and documentation for reproducibility.

As author of the paper 'NoHateS: A Hate Speech Detection System in Spanish Using Transformers-based models', I led continued data collection, cleaning, and augmentation for improving AI hate speech detection. I developed and compared several transformer-based models on annotated Spanish social interaction data. My work emphasized high-quality data preprocessing and evaluation to optimize model performance. • Collected and annotated additional hate speech datasets in Spanish. • Enhanced datasets with targeted cleaning and NLP-based augmentation. • Optimized and evaluated model architectures using labeled data. • Provided dataset curation and documentation for reproducibility.

2023 - 2023

Education

P

Peruvian University of Applied Sciences

Bachelor of Science, Computer Science

Bachelor of Science
2019 - 2024
A

Anglo-American Prescott School

High School Diploma, General Primary and Secondary Education

High School Diploma
2006 - 2018

Work History

T

Tracking And Tracing Solutions S.A.C.

Frontend Developer

Remote
2023 - 2023