John Migwi - AI Data Annotator | Remotask (Scale AI)

Key Skills

Software

Remotasks

Other

Appen

CrowdSource

Data Annotation Tech

Labelbox

Mindrift

Mercor

Snorkel AI

SuperAnnotate

Toloka

Telus

Top Subject Matter

Self-driving AI

NLP/Language Data

LLM/Chatbot AI

Top Data Types

Image

Text

Document

Top Task Types

Bounding Box

Entity Ner Classification

Segmentation

Classification

Point Key Point

Polyline

Cuboid

Object Detection

Text Generation

Polygon

Text Summarization

Question Answering

Fine Tuning

Transcription

Evaluation Rating

Prompt Response Writing SFT

Freelancer Overview

AI Data Annotator Remotask (Scale AI). Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Remotasks and Other. AI-training focus includeing data types such as Image and Text and labeling workflows including Bounding Box, Entity (NER) Classification, and Evaluation.

ExpertSwahiliEnglish

Labeling Experience

AI Model Evaluator & Prompt Rater | Outlier AI

OtherText

I evaluate LLM and chatbot responses for factuality, coherence, safety, and clarity in large-scale language model projects. I provide detailed feedback to improve AI outputs and reduce model errors across multiple iterations. My work supports the creation of scalable, high-quality datasets for advanced language model training. • Labeled and rated thousands of text samples for different evaluation tasks. • Assessed accuracy, hallucinations, relevance, and other key output parameters. • Provided in-depth prompt/output analysis to support model refinement. • Consistently achieved top evaluator performance scores in ongoing projects.

2024 - Present

Language Data Specialist & AI Evaluator | RWS Group

OtherTextEntity Ner Classification

I labeled and evaluated text data for NLP projects, specializing in text categorization, NER tagging, and sentiment analysis. I improved dataset clarity by cleaning and restructuring raw text, increasing usability and performance. My work enhanced the performance of multilingual NLP models across numerous projects. • Performed high-quality entity recognition and classification on diverse text datasets. • Evaluated LLM outputs based on accuracy, compliance, and safety standards. • Consistently maintained detailed metadata and improved dataset structure. • Recognized for exceptional quality and adherence to complex guidelines.

2022 - 2023

AI Data Annotator | Remotask (Scale AI)

RemotasksImageBounding Box

I performed image annotation for self-driving AI model training using bounding boxes, polygons, and semantic segmentation tools. I contributed QA validations, corrected annotation errors, and provided feedback for improved guidelines. I increased dataset quality and consistency, with high accuracy maintained throughout the role. • Labeled thousands of images using advanced annotation techniques. • Maintained 95-98% annotation accuracy with significant dataset quality improvements. • Reduced rejection rates and improved predictive performance for autonomous systems. • Utilized Remotasks and Scale AI platforms for annotation and review.

2021 - 2023

Education

M

Multimedia University of Kenya

Computer Programming, Computer Science

Computer Programming

2021 - 2024

Work History

A

AMPATH

Front-End Developer Intern

Eldoret

2023 - Present

F

Freelance

Software Developer

Nairobi

2022 - 2023