Senior Content Evaluator & Quality Lead, NordicAI Solutions
Led the evaluation of large-scale Norwegian and English text datasets to enhance AI-generated content safety, helpfulness, and technical accuracy. Performed high-precision pairwise comparisons and ranking of model responses based on strict guidelines. Audited junior annotators' work to achieve a 99% accuracy rate in 'Gold Standard' datasets. • Managed annotation guideline refinement for increased clarity • Collaborated with machine learning engineers to reduce labeling ambiguity • Ensured continuous quality assurance throughout labeling • Specialized in truthfulness and harmlessness assessments