Data Scientist (Data Labeling for ML Training)
As a Data Scientist at Remotask, I developed and deployed machine learning models that required annotated text data for training and testing. My responsibilities included preparing and classifying datasets for supervised learning applications, ensuring that the data was properly labeled for customer purchasing prediction models. I utilized text-based data drawn from e-commerce transactions and customer records, guiding best practices for data annotation and validation. • Labeled and classified text data for use in predictive modeling. • Collaborated with cross-functional teams to define labeling standards and protocols. • Ensured data quality and accuracy by performing QA on annotated datasets. • Utilized Remotasks platform for annotation workflows and task management.