LLM Evaluation, RLHF and NLP Data annotation project.
Supported training and fine turning of transformer based language model through structured evaluation and reinforcement learning feedback workflows. Annotated and evaluated 200,000+ text samples in,cluding prompts, model outputs and conversational exchanges. Responsibilities included: .Ranking Model responses for coherence and relevance. .Identifying hallucination and factual inaccuracies. .Bias and safety evaluation . .Named entitiy recognition. .Writing high quality supervised fine-turning datay. .Red-teaming model outputs for vulnerability testing.