RLHF Specialist
I evaluated and ranked LLM responses for appropriateness, safety, and technical correctness. This included identifying and correcting technical hallucinations in responses, especially in engineering and mathematical domains. My focus was on providing valuable human feedback for model improvement. • LLM prompt evaluation and ranking • Safety and hallucination detection • Engineering/mathematics answer review • RLHF process documentation