For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Tai Tran

Tai Tran

Data Labeler - Propaganda Detection LLM Thesis Project

VIETNAM flag
Ho Chi Minh City, Vietnam
$5.00/hrIntermediateGoogle Cloud Vertex AI

Key Skills

Software

Google Cloud Vertex AIGoogle Cloud Vertex AI

Top Subject Matter

Vietnamese political content
social media propaganda detection
LLM hallucination detection for Vietnamese language

Top Data Types

TextText
DocumentDocument

Top Task Types

Classification
Prompt Response Writing SFT

Freelancer Overview

Data Labeler - Propaganda Detection LLM Thesis Project. Brings 2+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, University of Information Technology (2022). AI-training focus includes data types such as Text and labeling workflows including Classification and Prompt + Response Writing (SFT).

IntermediateEnglishVietnamese

Labeling Experience

Prompt Engineer/Fine-tuner - UIT Data Science Challenge 2025

TextPrompt Response Writing SFT
I fine-tuned Qwen3-4B-Instruct-2507 for hallucination classification in Vietnamese LLM outputs. The process involved preparing high-quality instruction-based SFT prompts with three hallucination classes: no, intrinsic, extrinsic. Evaluation and quality control led to a top 7 ranking in a national data science competition. • Designed and annotated instruction-based classification prompts • Performed fine-tuning leveraging RLHF guidelines • Validated annotation quality via F1 metric (0.841) • Implemented structured evaluation using golden test cases

I fine-tuned Qwen3-4B-Instruct-2507 for hallucination classification in Vietnamese LLM outputs. The process involved preparing high-quality instruction-based SFT prompts with three hallucination classes: no, intrinsic, extrinsic. Evaluation and quality control led to a top 7 ranking in a national data science competition. • Designed and annotated instruction-based classification prompts • Performed fine-tuning leveraging RLHF guidelines • Validated annotation quality via F1 metric (0.841) • Implemented structured evaluation using golden test cases

2025 - 2025

Data Labeler - Propaganda Detection LLM Thesis Project

TextClassification
I collected and labeled 5,998 Vietnamese political comments for propaganda detection, applying tailored annotation guidelines. Manual labeling was performed using defined categories: PHAN DONG, KHONG PHAN DONG, and KHONG LIEN QUAN. The project required normalization of teencode and slang with the aid of contextual post summaries. • Developed Vietnamese-context annotation guidelines • Achieved Cohen’s Kappa agreement of 0.73 • Benchmarked model results to validate annotation quality • Annotated entire dataset over 3 months

I collected and labeled 5,998 Vietnamese political comments for propaganda detection, applying tailored annotation guidelines. Manual labeling was performed using defined categories: PHAN DONG, KHONG PHAN DONG, and KHONG LIEN QUAN. The project required normalization of teencode and slang with the aid of contextual post summaries. • Developed Vietnamese-context annotation guidelines • Achieved Cohen’s Kappa agreement of 0.73 • Benchmarked model results to validate annotation quality • Annotated entire dataset over 3 months

2025 - 2025

Education

U

University of Information Technology

Bachelor of Science, Information Systems

Bachelor of Science
2022

Work History

S

Sustainable Textile Solution

AI Engineer Intern

Ho Chi Minh City
2025 - 2026