For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
C

Carol Nyarotso

Data Systems Specialist - Data Labeling & QA

USA flagAtlanta, Usa
$25.00/hrIntermediateClickworkerAppenRemotasks

Key Skills

Software

ClickworkerClickworker
AppenAppen
RemotasksRemotasks
Other

Top Subject Matter

Healthcare and Analytics Data Quality
Healthcare Data Annotation
Public Health Data Annotation

Top Data Types

DocumentDocument
TextText

Top Task Types

ClassificationClassification
Data CollectionData Collection
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Text GenerationText Generation
Question AnsweringQuestion Answering
Text SummarizationText Summarization
RLHFRLHF
Fine-tuningFine-tuning
TranscriptionTranscription
Evaluation/RatingEvaluation/Rating
Computer Programming/CodingComputer Programming/Coding

Freelancer Overview

Data Systems Specialist with 6+ years of experience supporting data labeling, quality assurance, and research-driven workflows across complex environments. Skilled in working with internal and proprietary tools to ensure high-quality data outputs, with a strong focus on accuracy, consistency, and process improvement. Brings experience in document-based data workflows, including classification and structured labeling processes, with an understanding of how high-quality data supports AI and analytical systems. Holds a Master of Public Health from Maseno University and a Bachelor of Science from Mount Kenya University, combining a strong foundation in research, data systems, and quality-focused execution.

IntermediateEnglishSwahili

Labeling Experience

MEL Systems Specialist (Data Systems & Analytics Lead) - Data Labeling & QA

DocumentClassification
Reviewed and validated large-scale datasets to ensure consistency, accuracy, and adherence to structured annotation standards for AI training workflows. Leveraged Python-based automation to streamline validation processes, reduce manual effort, and maintain high data quality across international datasets. Validated and labeled datasets using Python (Pandas) and SQL, ensuring accuracy and consistency across structured data workflows Identified anomalies, missing values, and inconsistencies in healthcare and analytics datasets, improving overall data integrity Collaborated with multi-country teams to maintain data standards and ensure alignment across distributed workflows Automated data validation processes in Python, reducing manual review time and improving efficiency

Reviewed and validated large-scale datasets to ensure consistency, accuracy, and adherence to structured annotation standards for AI training workflows. Leveraged Python-based automation to streamline validation processes, reduce manual effort, and maintain high data quality across international datasets. Validated and labeled datasets using Python (Pandas) and SQL, ensuring accuracy and consistency across structured data workflows Identified anomalies, missing values, and inconsistencies in healthcare and analytics datasets, improving overall data integrity Collaborated with multi-country teams to maintain data standards and ensure alignment across distributed workflows Automated data validation processes in Python, reducing manual review time and improving efficiency

2024 - 2026

Digital Health Specialist - Data Classification & Review

DocumentClassification
Reviewed and processed sensitive healthcare records to support accurate classification, data integrity, and quality assurance within AI-driven data workflows. Identified discrepancies in patient and claims data and supported corrective actions, strengthening the reliability of datasets used for analytics and model training. Labeled and classified electronic health record (EHR) data to support structured data workflows and machine learning readiness Maintained data consistency using Excel and internal systems for tracking, validation, and documentation Identified and resolved discrepancies in patient and claims records, improving overall data accuracy and integrity Contributed to the refinement of healthcare datasets, enhancing their usability for analytics

Reviewed and processed sensitive healthcare records to support accurate classification, data integrity, and quality assurance within AI-driven data workflows. Identified discrepancies in patient and claims data and supported corrective actions, strengthening the reliability of datasets used for analytics and model training. Labeled and classified electronic health record (EHR) data to support structured data workflows and machine learning readiness Maintained data consistency using Excel and internal systems for tracking, validation, and documentation Identified and resolved discrepancies in patient and claims records, improving overall data accuracy and integrity Contributed to the refinement of healthcare datasets, enhancing their usability for analytics

2021 - 2023

Education

M

Maseno University

Master of Public Health, Public Health

Master of Public Health
2015 - 2021
M

Mount Kenya University

Bachelor of Science, Health Records and Information Management

Bachelor of Science
2012 - 2014

Work History

E

Episcopal Relief & Development

MEL Systems Specialist (Data Systems & Analytics Lead)

Atlanta
2024 - 2026
D

Datavant

Area Lead Health Information Specialist

Tampa
2024 - 2024