Expert RLHF Trainer & AI Content Specialist
As an Expert RLHF Trainer at Invisible Technologies (Meridian), I executed high-precision Supervised Fine-Tuning and Reinforcement Learning from Human Feedback to improve Large Language Model performance for Russian. I specialized in hallucination detection, prompt engineering, and ranking/evaluation of LLM outputs. My work directly supported model alignment and safety in Russian-language AI systems. • Conducted supervised fine-tuning on AI model outputs in Russian. • Performed expert-level reinforcement learning based on human feedback (RLHF). • Developed and applied complex prompt engineering strategies for LLMs. • Evaluated and ranked model outputs for accuracy, truthfulness, and linguistic quality.