For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Rufus Obet

Rufus Obet

Data Labeller

USA flag
Indiana, Usa
$15.00/hrExpertCloudfactoryCVATData Annotation Tech

Key Skills

Software

CloudFactoryCloudFactory
CVATCVAT
Data Annotation TechData Annotation Tech
LabelboxLabelbox
MercorMercor
RemotasksRemotasks
Scale AIScale AI
TelusTelus

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
ImageImage
TextText
VideoVideo

Top Label Types

Bounding Box
Polygon
Point Key Point
Segmentation

Freelancer Overview

I am an applied AI engineer with a strong foundation in mathematics and over five years of experience building, evaluating, and deploying machine learning solutions. My work has focused on the creation and curation of high-quality training datasets, including synthetic and augmented data generation to address data sparsity and improve model performance. I have hands-on experience with data labeling, annotation, and evaluation using Python, pandas, NumPy, and specialized ML tools such as scikit-learn, PyTorch, and Hugging Face Transformers. In several projects, I have built retrieval-augmented generation (RAG) systems and designed robust evaluation workflows to ensure data quality and model reliability. My background also includes translating business requirements into clear data annotation guidelines and delivering workshops to both technical and non-technical audiences, ensuring that data-centric AI solutions meet real-world needs. I am passionate about creating data pipelines and evaluation frameworks that drive trustworthy AI outcomes across domains like analytics, document understanding, and knowledge retrieval.

ExpertEnglish

Labeling Experience

CVAT

Data Associate

CVATImageBounding BoxPolygon
This project involved labeling images for machine learning model training, focusing on vehicle, human, and large language model (LLM) related datasets. Tasks included annotating images using bounding boxes and point/key points to precisely identify and mark key features. The project followed quality standards to ensure accurate and consistent labeling, adhering to defined labeling guidelines. The dataset size ranged from thousands to tens of thousands of images, with regular quality checks and validation processes in place to maintain high labeling accuracy. The data labeling was performed using the CVAT tool, ensuring efficient and streamlined workflows.

This project involved labeling images for machine learning model training, focusing on vehicle, human, and large language model (LLM) related datasets. Tasks included annotating images using bounding boxes and point/key points to precisely identify and mark key features. The project followed quality standards to ensure accurate and consistent labeling, adhering to defined labeling guidelines. The dataset size ranged from thousands to tens of thousands of images, with regular quality checks and validation processes in place to maintain high labeling accuracy. The data labeling was performed using the CVAT tool, ensuring efficient and streamlined workflows.

2019 - 2025

Education

U

University of California, Berkeley

Doctor of Philosophy (PhD) in Mathematics, Mathematics

Doctor of Philosophy (PhD) in Mathematics
2022 - 2024
U

University of California, Berkeley

Doctor of Philosophy, Mathematics

Doctor of Philosophy
2020 - 2024

Work History

S

San Francisco State University

Mathematics Tutor

San Francisco
2020 - Present
I

Independent & Contract Projects

Applied AI / Machine Learning Engineer

San Francisco
2020 - Present