Cypher Human Evals & RLHF
Compared and rated model-generated responses based on multiple evaluation criteria, including localization, instruction following, truthfulness, and style consistency. Contributed to RLHF by crafting and rewriting prompts to improve model alignment with human intent. Delivered detailed qualitative feedback to enhance overall model coherence and user experience.