Freelance Ai Evaluator
I utilized Reinforcement Learning from Human Feedback (RLHF) to evaluate and train Large Language Models (LLMs). I designed and refined complex prompts to test AI model boundaries, delivering comprehensive analytical evaluations of the generated text. Additionally, I performed rigorous quality assurance on NLP models, assessing outputs for harmlessness, truthfulness, safety, and strict instruction adherence.