AI Training and Data Labeling Specialist (Freelance)
Led advanced Reinforcement Learning from Human Feedback (RLHF) and data labeling tasks to optimize Large Language Model (LLM) reasoning and safety. Developed rigorous evaluation protocols for financial and fraud detection scenarios utilizing domain expertise. Contributed to prompt engineering, red teaming, annotation, and continuous quality assurance for AI model improvements. • Fine-tuned and evaluated AI LLMs on domain-specific tasks. • Engineered multi-turn prompts and performed red teaming to identify vulnerabilities. • Conducted factual, bias, and format quality assessments for AI outputs. • Streamlined AI training workflows promoting high-quality data and strict adherence to guidelines.