Model Validation Expert (MOVE Fellow)
As a Model Validation Expert (MOVE Fellow) at Handshake AI, I evaluate large language model (LLM) responses using structured rubric-based frameworks. I conduct RLHF training data generation through preference ranking and response comparison with predefined evaluation criteria. I focus on health science and medical domain QA for subject-matter accuracy. • Evaluate LLM response quality, factual accuracy, and reasoning coherence using structured rubrics • Generate RLHF training data via preference ranking and comparative response evaluation • Apply medical and scientific expertise in LLM evaluation tasks • Utilize Handshake AI Platform for data annotation and assessment