For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Qi Yu

Qi Yu

AI Training Data Specialist | LLM Prompt Engineering | Multilingual Content & Annotation

HONG_KONG flag
尖沙咀, Hong Kong
$15.00/hrEntry LevelOther

Key Skills

Software

Other

Top Subject Matter

No subject matter listed

Top Data Types

DocumentDocument
TextText

Top Label Types

Classification
Data Collection
Evaluation Rating

Freelancer Overview

I am an AI product and data operations professional with over 10 years of experience across internet platforms, telecom, gaming, e-commerce, and content domains. My expertise centers on data labeling, annotation, and AI training data workflows, including prompt engineering, output QA, and human-in-the-loop validation for large language models such as ChatGPT, Claude, Gemini, and Grok. I have designed prompt pipelines and converted unstructured text into structured, high-quality datasets for tasks like sentiment classification, topic extraction, and intent recognition, notably in projects like my YouTube Comment Analysis Tool. I am skilled in using Excel for data validation and KPI tracking, and I have a strong ability to bridge human language and structured data requirements for AI systems. My hands-on approach and practical project experience enable me to ensure data quality, consistency, and relevance in diverse AI and data-driven environments.

Entry LevelEnglishChinese Mandarin

Labeling Experience

LLM-Based YouTube Comment Labeling and Evaluation

OtherTextClassification
Built and operated an AI-powered system to collect and label YouTube user comments for LLM training and evaluation. Raw comments were filtered, cleaned, and categorized into sentiment, intent, and topic labels using prompt-based workflows. I performed human-in-the-loop review to verify label accuracy, detect hallucinations, bias, and low-quality outputs, and corrected model errors to create high-quality training data. The project involved thousands of real user comments and required consistency checks, ambiguity resolution, and quality scoring of AI-generated annotations. This dataset was used to evaluate and improve prompt pipelines and model alignment with real human language.

Built and operated an AI-powered system to collect and label YouTube user comments for LLM training and evaluation. Raw comments were filtered, cleaned, and categorized into sentiment, intent, and topic labels using prompt-based workflows. I performed human-in-the-loop review to verify label accuracy, detect hallucinations, bias, and low-quality outputs, and corrected model errors to create high-quality training data. The project involved thousands of real user comments and required consistency checks, ambiguity resolution, and quality scoring of AI-generated annotations. This dataset was used to evaluate and improve prompt pipelines and model alignment with real human language.

2025 - 2025

Education

S

Shaoyang University

Bachelor of Engineering, Landscape Architecture

Bachelor of Engineering
2010 - 2014

Work History

S

Short Video Platforms

Content and Growth Operator

SHENZHEN
2025 - 2025
S

Shopee

E-commerce Operator

SHENZHEN
2024 - 2025