NLP Dataset Annotation for Generative Language Models
Annotated and curated text datasets for large language model (LLM) fine-tuning. Performed entity recognition, classification, and summarization tasks to enhance model understanding and output quality. Developed prompt + response pairs for supervised fine-tuning (SFT) and evaluated model responses to ensure alignment with quality standards and desired tone. Managed dataset size of over 50,000+ text entries, ensuring high accuracy and consistency. The project improved generative AI capabilities in producing creative marketing copy, editorial content, and client-facing NLP outputs.