Language Data Annotator / LLM Trainer
I contributed to the training pipeline of a Korean large language model by creating, refining, and evaluating high-quality textual data tailored for Korean NLP tasks. My responsibilities included prompt and response generation for instruction tuning, curating culturally relevant datasets, and annotating linguistic nuances such as honorifics, formality levels, and idiomatic expressions unique to Korean. My native fluency in Korean and background in linguistics enabled me to ensure both linguistic accuracy and cultural alignment, enhancing the model’s performance in real-world Korean-language applications.