AI Training and Data Specialist
Led reinforcement learning from human feedback (RLHF) and model fine-tuning as part of AI model training tasks. Optimized prompts for large language models including training, evaluation, and feedback cycles. Implemented high-precision data tagging for supervised learning and annotation workflows. • Provided expert-level accuracy in text-based data annotation. • Specialized in prompt engineering for LLM improvement. • Conducted iterative labeling and review sessions. • Applied standardized methods and internal/proprietary tools for RLHF data preparation.