Abdulsalam Umar - Lead AI Data Annotation Engineer, GemmaCare Project

Key Skills

Software

Label Studio

Labelbox

LightTag

Top Subject Matter

Mental Health Conversational AI

ML Pipeline Benchmarking and Data Quality

Top Data Types

Text

Top Task Types

Classification

Freelancer Overview

Lead AI Data Annotation Engineer, GemmaCare Project. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Engineering, Ahmadu Bello University (2021). AI-training focus includes data types such as Text, Image and labeling workflows including Classification, Evaluation, and Rating.

IntermediateEnglish

Labeling Experience

Data Annotation & Evaluation Specialist, ML Pipeline Suite

Text

Designed and executed dataset validation and model evaluation protocols directly related to RLHF annotation processes. Built automated pipelines to detect and flag labeling inconsistencies, duplicate samples, and class imbalances in annotated corpora. Maintained detailed logs and developed structured evaluation rubrics analogous to preference annotation and RLHF scoring. • Applied pattern recognition to target annotation errors for correction workflows. • Standardized and cleaned JSON/CSV datasets prior to ML model preprocessing. • Documented label distributions and annotation decisions for reproducibility. • Executed solo ML benchmarking on 5,000+ annotated samples.

2025 - 2025

Lead AI Data Annotation Engineer, GemmaCare Project

TextClassification

Led the design and annotation of multi-turn dialogue datasets for a mental health AI chatbot focused on intent classification and slot filling. Applied rigorous sentiment labeling and emotional tone tagging to over 3,000 user utterances, ensuring high-quality corpora for fine-tuning conversational AI models. Defined comprehensive annotation guidelines and enforced inter-rater consistency in labeling workflows. • Created structured datasets in JSON/CSV formats for ML training frameworks. • Developed guidelines for dialogue state and response quality labeling. • Integrated and evaluated Google Gemini AI with manual quality review. • Improved annotation consistency and error detection through NLP preprocessing pipelines.

2025 - 2025

Education

A

Ahmadu Bello University

Bachelor of Engineering, Computer Engineering

Bachelor of Engineering

2021

Work History

G

GemmaCare

Python Developer

Zaria

2023 - Present

L

LifeGate

Backend Developer

Zaria

2023 - 2023