For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Dat Nguyen

Dat Nguyen

Expert in AI Code Generation & RLHF for Python, C++ in EN, VN, JP languages

Vietnam flagHo Chi Minh City, Vietnam
$20.00/hrIntermediateCVATGoogle Cloud Vertex AILabelimg

Key Skills

Software

CVATCVAT
Google Cloud Vertex AIGoogle Cloud Vertex AI
LabelImgLabelImg
OpenCV AI Kit (OAK)OpenCV AI Kit (OAK)
Internal/Proprietary Tooling

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
VideoVideo

Top Task Types

Computer Programming Coding
Evaluation Rating
RLHF
Text Summarization
Translation Localization

Freelancer Overview

I am a Computer Science graduate (Excellent Honors from HCMUT) and AI Engineer specializing in RLHF for Code Generation and NLP. I have over 2 years of experience in curating and annotated datasets for Machine Learning models, specifically in Python and C++. My expertise includes evaluating LLM code outputs, building datasets for recommendation systems, and conducting stylistic text analysis. Additionally, I hold a Japanese N3 certification, allowing me to perform high-quality multilingual data evaluation. I am detail-oriented and seeking complex technical tasks involving code review, algorithm debugging, and advanced linguistic analysis.

IntermediateEnglishJapaneseVietnamese

Labeling Experience

Novel Recommendation System

Internal Proprietary ToolingComputer Code ProgrammingRelationshipComputer Programming Coding
I developed a Novel Recommendation System using hybrid model (collaborative filtering and content-based filtering), where I was responsible for curating and cleaning large-scale user interaction datasets using Python libraries like Pandas and NumPy. My role involved evaluating model outputs, debugging code logic for algorithm optimization, and conducting RLHF-style ranking of recommendation results to enhance system accuracy. Additionally, I implemented backend logic in Python and optimized C++ data structures to ensure high-performance system integration for LLM training contexts.

I developed a Novel Recommendation System using hybrid model (collaborative filtering and content-based filtering), where I was responsible for curating and cleaning large-scale user interaction datasets using Python libraries like Pandas and NumPy. My role involved evaluating model outputs, debugging code logic for algorithm optimization, and conducting RLHF-style ranking of recommendation results to enhance system accuracy. Additionally, I implemented backend logic in Python and optimized C++ data structures to ensure high-performance system integration for LLM training contexts.

2025 - 2025

Studying the literary styles of famous 18th-century writers through ML and NLP

Internal Proprietary ToolingTextClassificationEvaluation Rating
In this research project, I conducted stylometric analysis on historical texts ranging from the 18th century to the present. I utilized NLP libraries (NLTK, Spacy) to perform complex text classification and linguistic feature extraction, ensuring high-quality data processing for authorship attribution models. My work included annotating semantic structures, analyzing writing styles, and handling multilingual data to create a robust, ground-truth labeled dataset for advanced Natural Language Processing tasks.

In this research project, I conducted stylometric analysis on historical texts ranging from the 18th century to the present. I utilized NLP libraries (NLTK, Spacy) to perform complex text classification and linguistic feature extraction, ensuring high-quality data processing for authorship attribution models. My work included annotating semantic structures, analyzing writing styles, and handling multilingual data to create a robust, ground-truth labeled dataset for advanced Natural Language Processing tasks.

2024 - 2024
OpenCV AI Kit (OAK)

Computer vision data engineer

Opencv AI Kit OakVideoBounding BoxEmotion Recognition
I implemented RNN models for dynamic object detection in video streams, focusing on precise frame-by-frame annotation for moving subjects such as animals and humans. I managed complex video datasets and performed rigorous quality assurance on labeled data to verify ground truth for classification tasks. This project required expertise in CVAT and OpenCV to ensure the accuracy of bounding boxes and temporal annotations needed for effective model convergence.

I implemented RNN models for dynamic object detection in video streams, focusing on precise frame-by-frame annotation for moving subjects such as animals and humans. I managed complex video datasets and performed rigorous quality assurance on labeled data to verify ground truth for classification tasks. This project required expertise in CVAT and OpenCV to ensure the accuracy of bounding boxes and temporal annotations needed for effective model convergence.

2023 - 2023

Education

H

Ho Chi Minh University of Technology

Bachelor of Science, Computer Science

Bachelor of Science
2021 - 2025

Work History

F

FPT Software

Business Analyst

Ho Chi Minh City
2025 - Present
S

Sorimachi Company Co.,Ltd.

Software Engineer

Ho Chi Minh City
2024 - 2024