For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Chycik Ayu Winata

Chycik Ayu Winata

LLM Evaluation Expert (Indonesian) with STEM & Diverse Project Exposure

INDONESIA flag
Jakarta, Indonesia
$20.00/hrIntermediateData Annotation TechTelus

Key Skills

Software

Data Annotation TechData Annotation Tech
TelusTelus

Top Subject Matter

No subject matter listed

Top Data Types

DocumentDocument
ImageImage
TextText

Top Task Types

Evaluation Rating
Prompt Response Writing SFT
Text Generation
Text Summarization
Translation Localization

Freelancer Overview

An experienced AI training contributor with a strong focus on LLM evaluation and personalized content analysis. I have worked on bilingual (English and Indonesian) projects involving prompt assessment, output review, and cultural localization. My attention to detail and linguistic fluency allow me to evaluate AI-generated content for accuracy, tone, and user alignment. Reviewed and refined AI-generated outputs by assessing relevance, factual accuracy, tone, and user intent. In addition to evaluating conversational AI responses, I have assessed the relevance and quality of personalized ads, ensuring they meet platform standards and user expectations. My background in both technical review and content analysis equips me to contribute to projects that require critical thinking, cultural sensitivity, and languageĀ precision.

IntermediateIndonesianEnglish

Labeling Experience

Data Annotation Tech

Bilingual LLM Evaluation Rater

Data Annotation TechTextClassificationQuestion Answering
In multiple AI evaluation projects, I assessed chatbot and AI-generated responses across five key dimensions: correctness/truthfulness, verbosity, instruction following, localization, and harmfulness. Tasks included evaluating single-turn and multi-turn interactions, crafting prompts to trigger specific errors or behaviors, and ensuring cultural and linguistic appropriateness in Indonesian while adhering to strict quality guidelines.

In multiple AI evaluation projects, I assessed chatbot and AI-generated responses across five key dimensions: correctness/truthfulness, verbosity, instruction following, localization, and harmfulness. Tasks included evaluating single-turn and multi-turn interactions, crafting prompts to trigger specific errors or behaviors, and ensuring cultural and linguistic appropriateness in Indonesian while adhering to strict quality guidelines.

2024
Data Annotation Tech

Talk to a Chatbot and Compare Responses, Focus on FACTUALITY Issues (Multi-Turn & Single-Turn))

Data Annotation TechTextClassificationQuestion Answering
The project focused on evaluating AI chatbot responses in both single-turn and multi-turn settings, with emphasis on detecting and categorizing factuality issues. In addition to assessment, tasks included crafting targeted prompts designed to elicit specific types of factual errors, as defined by the project guidelines. Evaluations covered accuracy, coherence, and alignment with user intent in both English and Indonesian. Strict quality measures were followed, including adherence to scoring rubrics and consistency checks across all evaluation cycles.

The project focused on evaluating AI chatbot responses in both single-turn and multi-turn settings, with emphasis on detecting and categorizing factuality issues. In addition to assessment, tasks included crafting targeted prompts designed to elicit specific types of factual errors, as defined by the project guidelines. Evaluations covered accuracy, coherence, and alignment with user intent in both English and Indonesian. Strict quality measures were followed, including adherence to scoring rubrics and consistency checks across all evaluation cycles.

2025 - 2025

Education

U

University of National Development "Veteran" East Java

Bachelor's in Informatics, Computer Science

Bachelor's in Informatics
2020 - 2024

Work History

E

Era Supplies Indonesia

Product Specialist

Jakarta
2025 - Present