For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Wei Nee Chin

Wei Nee Chin

AI Project Team Member – Data Labeling & QA

Malaysia flagJohor Bahru, Malaysia
$25.00/hrIntermediateScale AITelus

Key Skills

Software

Scale AIScale AI
TelusTelus

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Audio Recording
Evaluation Rating
Fine Tuning
Prompt Response Writing SFT
Red Teaming

Freelancer Overview

Since May 2024, I’ve been working as a freelancer on various AI training and data labeling projects, contributing across multiple domains including LLM evaluation, audio prompt transcription, image relevance tagging, and video quality comparison. My consistent performance and attention to detail have often led to promotions into reviewer roles, where I’ve been trusted to ensure labeling accuracy and guideline compliance. I take pride in being highly responsible and adaptable—quickly learning new project objectives, tools, and annotation standards. Whether working independently or as part of a team, I strive to deliver high-quality work that supports the development of reliable and ethical AI systems.

IntermediateEnglishChinese Mandarin

Labeling Experience

Scale AI

Audio Prompt Recording & Quality Evaluation

Scale AIAudioEvaluation RatingAudio Recording
Created spoken prompts by recording original audio and generating accurate transcripts to support speech-based AI training. Each submission included a self-assessment of recording quality based on clarity, tone, and background noise. The project required attention to detail, linguistic fluency, and consistent adherence to audio standards.

Created spoken prompts by recording original audio and generating accurate transcripts to support speech-based AI training. Each submission included a self-assessment of recording quality based on clarity, tone, and background noise. The project required attention to detail, linguistic fluency, and consistent adherence to audio standards.

2025 - 2025
Scale AI

Wilderness Queen – Multi-Turn Prompt Evaluation & Response Refinement

Scale AITextRLHFFine Tuning
Participated in a multi-turn LLM fine-tuning project involving up to three-turn conversations. Tasks included evaluating prompt-response pairs for quality, coherence, and helpfulness, as well as rewriting model responses to improve clarity, tone, and alignment with user intent. The project required strong judgment, linguistic sensitivity, and consistency in applying nuanced guidelines across evolving dialogue contexts.

Participated in a multi-turn LLM fine-tuning project involving up to three-turn conversations. Tasks included evaluating prompt-response pairs for quality, coherence, and helpfulness, as well as rewriting model responses to improve clarity, tone, and alignment with user intent. The project required strong judgment, linguistic sensitivity, and consistency in applying nuanced guidelines across evolving dialogue contexts.

2025 - 2025
Scale AI

Pangolin Translation SFT

Scale AITextTranslation LocalizationEvaluation Rating
Contributed to supervised fine-tuning of a multilingual language model by translating prompts and responses, reviewing machine-generated outputs, and ensuring linguistic accuracy and cultural relevance. Promoted to reviewer role for consistent quality and attention to detail.

Contributed to supervised fine-tuning of a multilingual language model by translating prompts and responses, reviewing machine-generated outputs, and ensuring linguistic accuracy and cultural relevance. Promoted to reviewer role for consistent quality and attention to detail.

2025 - 2025
Scale AI

Prompt & Rubric Creation for LLM Evaluation

Scale AITextFine TuningEvaluation Rating
Created diverse and targeted prompts to evaluate LLM performance across multiple tasks such as summarization, extraction, creative writing, etc. Designed rubrics to assess model outputs based on relevance, coherence, factual accuracy, and tone. The project required linguistic sensitivity, creativity, and alignment with specific evaluation guidelines to ensure consistent and meaningful scoring.

Created diverse and targeted prompts to evaluate LLM performance across multiple tasks such as summarization, extraction, creative writing, etc. Designed rubrics to assess model outputs based on relevance, coherence, factual accuracy, and tone. The project required linguistic sensitivity, creativity, and alignment with specific evaluation guidelines to ensure consistent and meaningful scoring.

2025 - 2025
Scale AI

Reinforcement Learning from Human Feedback (RLHF)

Scale AITextEvaluation RatingPrompt Response Writing SFT
Contributed to a RLHF project focused on improving LLM alignment with human preferences. Tasks included evaluating model responses for helpfulness, relevance, and safety, as well as ranking outputs and providing feedback to guide reinforcement learning. My sustained involvement over several months reflects my reliability, consistency, and deep understanding of nuanced evaluation guidelines.

Contributed to a RLHF project focused on improving LLM alignment with human preferences. Tasks included evaluating model responses for helpfulness, relevance, and safety, as well as ranking outputs and providing feedback to guide reinforcement learning. My sustained involvement over several months reflects my reliability, consistency, and deep understanding of nuanced evaluation guidelines.

2024 - 2025

Education

N

Northwood University

Bachelor Of Business Administration, International Business And Management

Bachelor Of Business Administration
2015 - 2018

Work History

A

Audi Australia

Product Coordinator

Zetland
2024 - 2025
C

Crystal International Group Limited

Business Associate

Hong Kong
2022 - 2023