AI Data Labeling & Evaluation – Math, Language and Code
Worked as a freelance AI data labeler and evaluator on multiple projects focused on improving large language models (LLMs). Tasks included evaluating and ranking AI-generated responses in mathematics, logical reasoning, programming (Python and SQL), and multilingual text (English and French). Performed data annotation, error detection, and quality assessment to ensure accuracy, clarity, and alignment with project guidelines. Contributed to model performance improvement by providing structured feedback, corrected solutions, and high-quality labeled datasets while strictly adhering to confidentiality and quality standards.