For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M

Michael Nkemdilim

AI Engineer & Data Scientist – LLM Output Evaluation and Human-in-the-loop Feedback

USA flagAtlanta, Usa
$25.00/hrExpertAws SagemakerLabelboxCVAT

Key Skills

Software

AWS SageMakerAWS SageMaker
LabelboxLabelbox
CVATCVAT
Label StudioLabel Studio
SuperAnnotateSuperAnnotate
EncordEncord

Top Subject Matter

Large Language Models
AI Systems
Prompt Engineering

Top Data Types

TextText
ImageImage
VideoVideo

Top Task Types

Data CollectionData Collection

Freelancer Overview

AI Engineer & Data Scientist – LLM Output Evaluation and Human-in-the-loop Feedback. Brings 9+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, Georgia Institute of Technology (2021) and Bachelor of Science, Georgia State University (2017). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Data Collection.

ExpertEnglish

Labeling Experience

AI Engineer & Data Scientist – LLM Output Evaluation and Human-in-the-loop Feedback

Text
I evaluated and ranked the outputs from large language models (LLMs) such as GPT-4o and Llama for reasoning, factual accuracy, and safety criteria. I designed and implemented large-scale LLM evaluation pipelines, integrating structured workflows and human-in-the-loop feedback for iterative model improvement. This process contributed to enhanced model reliability, reduced hallucination rates, and improved benchmarking efficiency. • Implemented evaluation frameworks optimizing response quality. • Collaborated on RLHF-style feedback systems to refine model alignment. • Leveraged prompt engineering to target error reduction in outputs. • Used benchmarking metrics to increase efficiency across multiple domains.

I evaluated and ranked the outputs from large language models (LLMs) such as GPT-4o and Llama for reasoning, factual accuracy, and safety criteria. I designed and implemented large-scale LLM evaluation pipelines, integrating structured workflows and human-in-the-loop feedback for iterative model improvement. This process contributed to enhanced model reliability, reduced hallucination rates, and improved benchmarking efficiency. • Implemented evaluation frameworks optimizing response quality. • Collaborated on RLHF-style feedback systems to refine model alignment. • Leveraged prompt engineering to target error reduction in outputs. • Used benchmarking metrics to increase efficiency across multiple domains.

2022 - 2025

Machine Learning Specialist – Data Annotation and Validation Workflow

TextData Collection
I constructed and managed data preprocessing and annotation pipelines for natural language processing (NLP) and computer vision research. I supported high-stakes decision workflows requiring human validation loops, focusing on data preparation and iterative experimentation for model testing. My contributions led to improvements in dataset quality and reductions in preparation time for downstream modeling tasks. • Built pipelines for efficient dataset labeling and annotation. • Conducted iterative evaluation of model predictions and validation tasks. • Integrated human-in-the-loop processes for system verification. • Optimized dataset readiness for machine learning experimentation.

I constructed and managed data preprocessing and annotation pipelines for natural language processing (NLP) and computer vision research. I supported high-stakes decision workflows requiring human validation loops, focusing on data preparation and iterative experimentation for model testing. My contributions led to improvements in dataset quality and reductions in preparation time for downstream modeling tasks. • Built pipelines for efficient dataset labeling and annotation. • Conducted iterative evaluation of model predictions and validation tasks. • Integrated human-in-the-loop processes for system verification. • Optimized dataset readiness for machine learning experimentation.

2019 - 2021

Education

G

Georgia Institute of Technology

Master of Science, Computer Science

Master of Science
2021 - 2021
G

Georgia State University

Bachelor of Science, Computer Science

Bachelor of Science
2017 - 2017

Work History

H

HatchWorks AI

AI Engineer & Data Scientist

Atlanta
2022 - 2025
G

Georgia Tech Research Institute

Machine Learning Specialist

Atlanta
2019 - 2021