AI Data Annotator & Content Specialist, Outlier AI
I performed expert-level data annotation and labeling for large language model (LLM) training datasets spanning health, legal, and technical domains. My work included fact-checking, source verification, web research, and evaluation of intent and context in user queries. I utilized Reinforcement Learning from Human Feedback (RLHF) methods, preference labeling, response ranking, and contributed to prompt engineering and red-teaming activities for AI models. • Evaluated LLM outputs for accuracy, helpfulness, safety, and tone • Identified hallucinations, bias, and inconsistencies in AI responses • Conducted thorough validation and ensured high-quality labeling standards • Improved model robustness and alignment through continuous feedback