Generalist LLM Evaluation – Senior Reviewer
As part of a large-scale project aimed at training and fine-tuning general-purpose language models, I was promoted to Senior Reviewer based on the consistent quality of my evaluations. My responsibilities included: - Reviewing AI-generated responses across a wide range of topics (general knowledge, logical reasoning, creative writing, etc.). - Assessing accuracy, coherence, tone, and helpfulness in alignment with evolving guidelines. - Comparing model outputs (side-by-side evaluation), ranking completions, and identifying failure modes. - Reviewing the work of other annotators and providing quality control and mentoring support. - Liaising with project admins to ensure guideline updates were well implemented.