LLM Evaluation and RLHF Data Labeling (TR/EN)
Contributed to large-scale LLM training by labeling and evaluating AI-generated text in both Turkish and English. Tasks included ranking model outputs, writing prompts and responses for supervised fine-tuning, evaluating text coherence, fluency, and relevance, as well as providing human feedback for reinforcement learning.