data annotation
Project Title: Instructional Dialogue Annotation for Assistant-Style LLM Objective: To create a high-quality, diverse dataset of instruction-response pairs to fine-tune a large language model (LLM) for helpful, harmless, and accurate conversational AI. Scope of Work: Data Categorization: Annotate raw text prompts into categories (e.g., Creative Writing, Technical Q&A, Reasoning, Customer Service, Ethical Dilemma). Rewriting & Enhancement: Transform simple queries into well-structured, clear instructions. Example: Input: "tell me about python." Annotated Instruction: "Explain the Python programming language to a beginner, covering its main uses and key features in 3-4 concise paragraphs." Safety & Quality Labeling: Flag prompts containing: Harmful, unethical, or biased content. Requests for illegal activities. Factually incorrect premises (to be noted for model training). Style Specification: Tag the desired tone for the response (e.g., Formal, Friendly, Concise, Persuasive,