AI Annotator / LLM Trainer
As an AI Annotator and LLM Trainer at Medivial AI, I annotated and validated datasets for Natural Language Processing (NLP) and large language model (LLM) training pipelines. I performed structured quality assurance on AI-generated outputs and contributed to reinforcement learning from human feedback (RLHF) tasks. My responsibilities included identifying and documenting edge cases, ambiguities, hallucinations, and factual inconsistencies in language model outputs. • Annotated and validated text datasets for LLM pipelines • Performed RLHF to enhance model alignment and reasoning • Provided structured feedback to improve response accuracy • Utilized annotation platforms such as Toloka and Medivial AI