For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
J

Jeremy Gross

Data Scientist (Data Labeling for ML Training)

USA flag
N/A, Usa
$20.00/hrExpertRemotasksMicro1Mercor

Key Skills

Software

RemotasksRemotasks
Micro1
MercorMercor

Top Subject Matter

E-commerce Product Data & Taxonomy
Purchasing Behavior Analytics
Customer Feedback Analysis/NLP

Top Data Types

TextText
AudioAudio
DocumentDocument

Top Task Types

Classification
Prompt Response Writing SFT
Text Generation
Question Answering
Text Summarization
Transcription
Data Collection

Freelancer Overview

Data Scientist (Data Labeling for ML Training). Brings 11+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Remotasks, Internal, and Proprietary Tooling. Education includes Bachelor of Science, University of California, San Diego (2016). AI-training focus includes data types such as Text and labeling workflows including Classification.

ExpertEnglish

Labeling Experience

Remotasks

Data Scientist (Data Labeling for ML Training)

RemotasksTextClassification
As a Data Scientist at Remotask, I developed and deployed machine learning models that required annotated text data for training and testing. My responsibilities included preparing and classifying datasets for supervised learning applications, ensuring that the data was properly labeled for customer purchasing prediction models. I utilized text-based data drawn from e-commerce transactions and customer records, guiding best practices for data annotation and validation. • Labeled and classified text data for use in predictive modeling. • Collaborated with cross-functional teams to define labeling standards and protocols. • Ensured data quality and accuracy by performing QA on annotated datasets. • Utilized Remotasks platform for annotation workflows and task management.

As a Data Scientist at Remotask, I developed and deployed machine learning models that required annotated text data for training and testing. My responsibilities included preparing and classifying datasets for supervised learning applications, ensuring that the data was properly labeled for customer purchasing prediction models. I utilized text-based data drawn from e-commerce transactions and customer records, guiding best practices for data annotation and validation. • Labeled and classified text data for use in predictive modeling. • Collaborated with cross-functional teams to define labeling standards and protocols. • Ensured data quality and accuracy by performing QA on annotated datasets. • Utilized Remotasks platform for annotation workflows and task management.

2020 - 2025

Data Science Intern (NLP Data Labeling)

TextClassification
During my internship at Dell Technologies, I assisted in the development of a natural language processing model by contributing to the labeling and categorization of customer feedback data. My work involved classifying customer comments according to sentiment and intent, directly supporting model accuracy improvements and actionable insights. I coordinated data collection, quality checks, and labeling documentation throughout the NLP pipeline. • Categorized and labeled customer feedback for sentiment analysis. • Maintained accurate records of annotation activities and sample distributions. • Collaborated with data scientists to align on label definitions and criteria. • Used internal tooling for text data annotation and quality review.

During my internship at Dell Technologies, I assisted in the development of a natural language processing model by contributing to the labeling and categorization of customer feedback data. My work involved classifying customer comments according to sentiment and intent, directly supporting model accuracy improvements and actionable insights. I coordinated data collection, quality checks, and labeling documentation throughout the NLP pipeline. • Categorized and labeled customer feedback for sentiment analysis. • Maintained accurate records of annotation activities and sample distributions. • Collaborated with data scientists to align on label definitions and criteria. • Used internal tooling for text data annotation and quality review.

2016 - 2017

Education

U

University of California, San Diego

Bachelor of Science, Mathematics and Computer Science

Bachelor of Science
2012 - 2016

Work History

R

Remotask

Data Scientist

N/A
2020 - Present
I

IBM

Junior Data Scientist

Boston
2017 - 2019