Tai Tran - Data Labeler - Propaganda Detection LLM Thesis Project

Key Skills

Software

Google Cloud Vertex AI

Top Subject Matter

Vietnamese political content

social media propaganda detection

LLM hallucination detection for Vietnamese language

Top Data Types

Text

Document

Top Task Types

Classification

Prompt Response Writing SFT

Freelancer Overview

Data Labeler - Propaganda Detection LLM Thesis Project. Brings 2+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, University of Information Technology (2022). AI-training focus includes data types such as Text and labeling workflows including Classification and Prompt + Response Writing (SFT).

IntermediateEnglishVietnamese

Labeling Experience

Prompt Engineer/Fine-tuner - UIT Data Science Challenge 2025

TextPrompt Response Writing SFT

I fine-tuned Qwen3-4B-Instruct-2507 for hallucination classification in Vietnamese LLM outputs. The process involved preparing high-quality instruction-based SFT prompts with three hallucination classes: no, intrinsic, extrinsic. Evaluation and quality control led to a top 7 ranking in a national data science competition. • Designed and annotated instruction-based classification prompts • Performed fine-tuning leveraging RLHF guidelines • Validated annotation quality via F1 metric (0.841) • Implemented structured evaluation using golden test cases

2025 - 2025

Data Labeler - Propaganda Detection LLM Thesis Project

TextClassification

I collected and labeled 5,998 Vietnamese political comments for propaganda detection, applying tailored annotation guidelines. Manual labeling was performed using defined categories: PHAN DONG, KHONG PHAN DONG, and KHONG LIEN QUAN. The project required normalization of teencode and slang with the aid of contextual post summaries. • Developed Vietnamese-context annotation guidelines • Achieved Cohen’s Kappa agreement of 0.73 • Benchmarked model results to validate annotation quality • Annotated entire dataset over 3 months

2025 - 2025

Education

U

University of Information Technology

Bachelor of Science, Information Systems

Bachelor of Science

2022

Work History

S

Sustainable Textile Solution

AI Engineer Intern

Ho Chi Minh City

2025 - 2026