For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Jonathan Kang

Jonathan Kang

Master's of Health Informatics Student

USA flagAnn Arbor, Usa
$20.00/hrEntry LevelDon T Disclose

Key Skills

Software

Don't disclose

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText
VideoVideo

Top Task Types

Audio Recording
Bounding Box
Emotion Recognition
Evaluation Rating
Object Detection
Prompt Response Writing SFT
Transcription

Freelancer Overview

I have a strong background in data analysis and health informatics, with hands-on experience in data cleaning, aggregation, and transformation across large, complex datasets. My academic projects have focused on preparing and normalizing data from sources like the CDC, EPA, and Census Bureau, converting qualitative information into quantitative formats suitable for statistical modeling and visualization. I am skilled in Python (Pandas, NumPy, scikit-learn), SQL, and Excel, and have practical experience managing and ensuring the integrity of electronic health records (EHR) using systems such as NextGen and Practice Fusion. My work has required meticulous attention to detail, data quality, and compliance, making me well-suited for data labeling, annotation, and AI training data roles, especially in medical and public health domains.

Entry LevelKoreanEnglish

Labeling Experience

Project Hedgehog Multimodal AI Trainer (Handshake AI)

Don T DiscloseAudioAudio RecordingTranscription
Project Hedgehog is a large-scale multimodal alignment initiative focused on training frontier Large Language Models to interpret and generate content across text, audio, and video domains. As a specialized trainer, I performed high-precision data labeling tasks including audio data synthesis, image bounding box annotation, and video-to-text prompt engineering to improve the model’s spatial and auditory awareness. I was responsible for developing and iteratively editing complex prompts to test the model's logical reasoning, while adhering to rigorous quality benchmarks and technical rubrics to ensure the output was factual, safe, and instruction-compliant.

Project Hedgehog is a large-scale multimodal alignment initiative focused on training frontier Large Language Models to interpret and generate content across text, audio, and video domains. As a specialized trainer, I performed high-precision data labeling tasks including audio data synthesis, image bounding box annotation, and video-to-text prompt engineering to improve the model’s spatial and auditory awareness. I was responsible for developing and iteratively editing complex prompts to test the model's logical reasoning, while adhering to rigorous quality benchmarks and technical rubrics to ensure the output was factual, safe, and instruction-compliant.

2025 - 2025

Education

U

University of Michigan School of Information

Master of Health Informatics, Health Informatics

Master of Health Informatics
2024 - 2025
U

University of Michigan

Bachelor of Science, Biomolecular Science

Bachelor of Science
2018 - 2022

Work History

S

Specialists In Rehabilitation Medicine

Medical Assistant

Rochester Hills
2022 - 2024