AI Content Evaluator & Specialist (Freelance)
I specialized in ranking, evaluating, and providing reinforcement learning from human feedback (RLHF) for large language models (LLMs). My responsibilities included assessing AI-generated responses for accuracy, safety, and linguistic quality. I supported iterative AI improvement via prompt testing and data annotation. • Performed evaluation of LLM outputs in English for accuracy and compliance • Conducted prompt engineering and created feedback loops for model refinement • Utilized specialized annotation tools in a remote environment • Managed quality assurance tasks within AI content moderation workflows.