For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Y
Yikuuu Zhang

Yikuuu Zhang

Model Evaluation and Labeling Contributor

China flagBeijing, China
$80.00/hrExpertLabel StudioCVATLabelbox

Key Skills

Software

Label StudioLabel Studio
CVATCVAT
LabelboxLabelbox
AWS SageMakerAWS SageMaker
EncordEncord
SuperAnnotateSuperAnnotate
Scale AIScale AI

Top Subject Matter

Natural Language Processing / Model Evaluation

Top Data Types

TextText
VideoVideo
ImageImage

Top Task Types

Text GenerationText Generation
Question AnsweringQuestion Answering
Computer Programming/CodingComputer Programming/Coding
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Fine-tuningFine-tuning

Freelancer Overview

Model Evaluation and Labeling Contributor. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, Tsinghua University (2023) and Bachelor of Science, Bachelor's Institution (2020). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

ExpertEnglish

Labeling Experience

Model Evaluation and Labeling Contributor

Text
Contributed to building and evaluating datasets for model assessment and selection in a large language model (LLM) company. Created and labeled evaluation datasets for various natural language processing tasks, such as Q&A, reasoning, summarization, and information extraction. Assessed mainstream language models based on multiple criteria, including accuracy, hallucination rate, and robustness, to support business deployment. • Built datasets for Q&A, reasoning, summarization, and extraction. • Rated model outputs for accuracy and reliability. • Contributed to the design of model evaluation and selection processes. • Supported real-world deployment by evaluating model performance.

Contributed to building and evaluating datasets for model assessment and selection in a large language model (LLM) company. Created and labeled evaluation datasets for various natural language processing tasks, such as Q&A, reasoning, summarization, and information extraction. Assessed mainstream language models based on multiple criteria, including accuracy, hallucination rate, and robustness, to support business deployment. • Built datasets for Q&A, reasoning, summarization, and extraction. • Rated model outputs for accuracy and reliability. • Contributed to the design of model evaluation and selection processes. • Supported real-world deployment by evaluating model performance.

2023 - Present

Education

T

Tsinghua University

Master of Science, Computer Science and Technology

Master of Science
2020 - 2023
B

Bachelor's Institution

Bachelor of Science, Computer Science

Bachelor of Science
2016 - 2020

Work History

L

Leading Chinese Foundation Model Company

LLM Algorithm Engineer / AI Application Engineer

Beijing
2023 - Present