LLM evaluation & Text Annotation
Contributed to training and improving large language models by evaluating AI-generated responses for accuracy, relevance, and bias. Performed detailed text classification, sentiment analysis, and ranking tasks in both English and French. Followed strict annotation guidelines to ensure consistency and quality across datasets. Provided structured feedback to refine model performance, enhance safety, and reduce bias in outputs.