AI Training Data Contributor (RLHF & SFT)
I contributed to data labeling projects centered on reinforcement learning from human feedback, fine-tuning large language models for accuracy, safety, and logical consistency. My responsibilities included evaluating and ranking model responses, identifying AI-generated hallucinations, and providing feedback to enhance model tone and helpfulness. I conducted detailed assessments of language quality and cultural relevance for West African contexts. • Evaluated and rated model outputs for truthfulness and reasoning. • Identified and annotated instances of hallucinations in LLMs. • Suggested improvements to tone and user helpfulness. • Provided context-aware feedback for linguistic accuracy.