For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M

Maureen Maina

AI and Data Science Intern (OpenAI)

KENYA flag
Nairobi, Kenya
$25.00/hrExpertOtherRemotasksRoboflow

Key Skills

Software

Other
RemotasksRemotasks
RoboflowRoboflow
Scale AIScale AI
Snorkel AISnorkel AI
TolokaToloka
TelusTelus
Surge AISurge AI
SuperAnnotateSuperAnnotate
MindriftMindrift
MercorMercor
Micro1
Mighty AIMighty AI
Anno-MageAnno-Mage

Top Subject Matter

Natural Language Processing (NLP)
Computer Vision
AI Image Recognition & NLP (Academic Research)

Top Data Types

TextText
ImageImage
Computer Code ProgrammingComputer Code Programming

Top Task Types

Text Generation
Classification
Bounding Box
Polygon
Segmentation
Entity Ner Classification
Point Key Point
Polyline
Cuboid
Object Detection
Question Answering
Text Summarization
RLHF
Fine Tuning
Red Teaming
Transcription
Evaluation Rating
Computer Programming Coding

Freelancer Overview

AI and Data Science Intern (OpenAI). Brings 6+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Master of Science, Harvard University (2020) and Bachelor of Science, Harvard University (2018). AI-training focus includes data types such as Text and Image and labeling workflows including Text Generation and Classification.

ExpertEnglish

Labeling Experience

Open Source Tool Developer (Global Community)

OtherTextText Generation
Developed open-source tools to automate data annotation for machine learning practitioners worldwide. Designed workflows for large-scale annotation of text data to support AI model training. Supported the AI community by providing resources to streamline labeling and improve model outcomes. • Created reusable annotation scripts/tools • Facilitated broad access to AI labeling resources • Enhanced the efficiency of NLP dataset creation • Championed best practices for annotation

Developed open-source tools to automate data annotation for machine learning practitioners worldwide. Designed workflows for large-scale annotation of text data to support AI model training. Supported the AI community by providing resources to streamline labeling and improve model outcomes. • Created reusable annotation scripts/tools • Facilitated broad access to AI labeling resources • Enhanced the efficiency of NLP dataset creation • Championed best practices for annotation

2021 - Present

AI and Data Science Intern (OpenAI)

TextText Generation
Implemented annotation frameworks to optimize model training for multilingual datasets. Designed and maintained data pipelines for text labeling in multiple languages, ensuring annotation consistency and quality. Contributed to projects requiring both manual and automated data annotation for advanced language models. • Developed annotation tools for NLP tasks • Managed multilingual data labeling workflows • Collaborated on model fine-tuning strategies • Ensured data quality and accuracy

Implemented annotation frameworks to optimize model training for multilingual datasets. Designed and maintained data pipelines for text labeling in multiple languages, ensuring annotation consistency and quality. Contributed to projects requiring both manual and automated data annotation for advanced language models. • Developed annotation tools for NLP tasks • Managed multilingual data labeling workflows • Collaborated on model fine-tuning strategies • Ensured data quality and accuracy

2020 - 2020

Senior Data Science Consultant (Global Tech Solutions)

ImageClassification
Developed automated annotation pipelines to label images for machine learning model training. Reduced human error and improved annotation throughput by integrating automation tools. Supervised labeling processes to ensure training data met accuracy standards. • Automated high-volume image labeling • Monitored classification quality for datasets • Supported model training for computer vision tasks • Optimized annotation processes for efficiency

Developed automated annotation pipelines to label images for machine learning model training. Reduced human error and improved annotation throughput by integrating automation tools. Supervised labeling processes to ensure training data met accuracy standards. • Automated high-volume image labeling • Monitored classification quality for datasets • Supported model training for computer vision tasks • Optimized annotation processes for efficiency

2018 - 2019

Lead Undergraduate Researcher (Harvard University)

TextClassification
Led undergraduate research projects that involved annotating text data for Natural Language Processing and image data for computer vision tasks. Developed datasets for AI image recognition and language processing applications. Ensured accurate entity classification and data validation steps throughout the process. • Managed annotation team for academic research • Coordinated data quality control for datasets • Created and validated labeled data for NLP • Supervised annotation for image classification

Led undergraduate research projects that involved annotating text data for Natural Language Processing and image data for computer vision tasks. Developed datasets for AI image recognition and language processing applications. Ensured accurate entity classification and data validation steps throughout the process. • Managed annotation team for academic research • Coordinated data quality control for datasets • Created and validated labeled data for NLP • Supervised annotation for image classification

2015 - 2017

Education

H

Harvard University

Master of Science, Computer Science

Master of Science
2018 - 2020
H

Harvard University

Bachelor of Science, Computer Science

Bachelor of Science
2014 - 2018

Work History

H

Harvard University

Lead Data Scientist and AI Research Specialist

Cambridge
2018 - 2021
G

Global Tech Solutions

Senior Data Science Consultant

Nairobi
2016 - 2018