AI Training & LLM Evaluation Specialist – Outlier
Worked as an AI Training Specialist on large language model evaluation projects. Responsibilities included reviewing and rating AI-generated responses based on accuracy, coherence, reasoning quality, safety compliance, and instruction-following. Provided structured feedback to improve model performance using RLHF methodologies. Compared multiple model outputs, ranked responses, and identified hallucinations or logical inconsistencies. Ensured adherence to strict quality guidelines and maintained high evaluation accuracy under defined KPIs