Text Labeling and Language Model Evaluation for Multilingual NLP Projects
In this project, I was responsible for labeling and evaluating text for training and fine-tuning large language models (LLMs) in Indonesian and English. Key tasks included sentence classification based on tone and intent, named entity annotation (NER), evaluating model responses to prompts, and writing and refining prompt-responses in the context of questions and answers and text summaries. I was also involved in content translation and localization, as well as stylistic adjustments for varying levels of formality. The project involved thousands of data units, with high-quality standards such as Cohen's Kappa for consistency and multiple review.