Japanese LLM Prompt–Response Evaluation and Editing
I evaluated Japanese prompt–response pairs for LLM training. I created fine-grained evaluation criteria and scored responses based on structure, tone, relevance, and factual accuracy. I also edited generated responses for clarity and naturalness, and wrote prompts and sample completions (SFT-style) to guide model behavior. My feedback contributed to model fine-tuning and annotation guideline improvement.