Instruction Dataset Curator/Annotator
I contributed to the creation and curation of instruction-tuning datasets for LLM fine-tuning, generating high-quality prompt-response pairs with RLHF preference annotations. I integrated inter-annotator agreement metrics and automatic rejection for low-consensus batches. I ensured the datasets met rigorous quality standards used in production and academic research. • Produced 12,000+ prompt-response pairs with RLHF annotation • Implemented Cohen’s Kappa agreement for annotator quality control • Developed and utilized human-in-loop interfaces for review • Supported LLM fine-tuning pipelines with quality-checked data