AI Model Evaluator & Annotator (Independent Contractor)
As an AI Model Evaluator & Annotator, I performed in-depth comparative analysis of large language models (LLMs). I completed complex RLHF (Reinforcement Learning from Human Feedback) tasks designed to optimize AI conversational flows. My focus was ensuring data integrity and training quality through granular model evaluations. • Conducted detailed side-by-side LLM comparisons. • Identified and documented differences in model accuracy and reasoning. • Applied KYC-level scrutiny to eliminate hallucinations and inconsistencies in training data. • Provided critical feedback to help refine AI model outputs.