For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
C
Calvin Hizkia

Calvin Hizkia

Bilingual AI Data Annotator for LLM Fine-Tuning (EN/ID)

Indonesia flagJakarta, Indonesia
$5.00/hrEntry LevelAppenLabelboxRemotasks

Key Skills

Software

AppenAppen
LabelboxLabelbox
RemotasksRemotasks
Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Fine-tuningFine-tuning
RLHFRLHF
Translation/LocalizationTranslation/Localization

Freelancer Overview

I specialize in refining large language models through data annotation and RLHF evaluation, having worked on projects for leading AI labs like Scale AI. My core focus is on improving model response quality, naturalness, and multilingual accuracy. My effectiveness stems from a strong blend of technical annotation skills and bilingual expertise (English-Indonesian), which allows me to ensure culturally nuanced and precise data labeling for AI training. This is further supported by a sharp analytical mindset honed in finance.

Entry LevelIndonesianEnglish

Labeling Experience

Labelbox

Voice Activity Detection Annotator

LabelboxAudioAudio Recording
This project involved annotating audio data to train and evaluate AI speech recognition models. The core task was to meticulously transcribe and timestamp all speech and significant non-speech events in human-AI dialogues. Key Responsibilities: Precisely labeled speaker turns (User and AI Assistant) with millisecond accuracy. Classified user tokens into Standard Speech, Interruptions (barge-ins), and Acknowledgements (backchanneling like "mm-hmm"). Annotated non-speech events such as <pause>, <stop>, and <noise> (e.g., coughs, laughter) according to strict linguistic guidelines. Ensured continuous, gap-free annotation across the audio timeline to create seamless data for model training. Skipped tasks where the AI responded in a non-English language to maintain data integrity for the target models. The resulting annotated data was critical for improving how AI assistants handle natural conversational nuances like filled pauses, overlaps, and turn-taking.

This project involved annotating audio data to train and evaluate AI speech recognition models. The core task was to meticulously transcribe and timestamp all speech and significant non-speech events in human-AI dialogues. Key Responsibilities: Precisely labeled speaker turns (User and AI Assistant) with millisecond accuracy. Classified user tokens into Standard Speech, Interruptions (barge-ins), and Acknowledgements (backchanneling like "mm-hmm"). Annotated non-speech events such as <pause>, <stop>, and <noise> (e.g., coughs, laughter) according to strict linguistic guidelines. Ensured continuous, gap-free annotation across the audio timeline to create seamless data for model training. Skipped tasks where the AI responded in a non-English language to maintain data integrity for the target models. The resulting annotated data was critical for improving how AI assistants handle natural conversational nuances like filled pauses, overlaps, and turn-taking.

2025
Scale AI

Linguistic Evaluator

Scale AITextFine Tuning
As a Linguistic Evaluator for Meta's Multilingual Static Comparison Project V2, I specialized in enhancing AI model performance for the Indonesian language. My core responsibilities involved conducting comparative analysis of AI responses to identify the most accurate and contextually appropriate reply. I meticulously tagged errors in language, conversation flow, and cultural alignment, and ranked responses based on strict quality guidelines. For substandard outputs, I authored high-quality rewrites that corrected errors, improved fluency, and ensured cultural and contextual relevance, directly contributing to the model's training and refinement.

As a Linguistic Evaluator for Meta's Multilingual Static Comparison Project V2, I specialized in enhancing AI model performance for the Indonesian language. My core responsibilities involved conducting comparative analysis of AI responses to identify the most accurate and contextually appropriate reply. I meticulously tagged errors in language, conversation flow, and cultural alignment, and ranked responses based on strict quality guidelines. For substandard outputs, I authored high-quality rewrites that corrected errors, improved fluency, and ensured cultural and contextual relevance, directly contributing to the model's training and refinement.

2025 - 2025

Education

K

Kaima Adventist High School

Diploma, High School

Diploma
2010 - 2010

Work History

P

PT Fazzmart Teknologi Nusantara

Sales Representative

Manado
2021 - 2023
P

PT Sarana Andiriksa Express

Finance Staff

Manado
2017 - 2019