Multilingual AI Prompt Evaluation & Transcription Labeling
This project focused on improving large language model output across multiple languages (Arabic, English, French). Tasks included ranking AI-generated responses, rewriting for clarity or correctness, evaluating factual accuracy, and reviewing transcripts with detailed error annotation (including filler words, hesitations, false starts). I followed strict guidelines to ensure consistency, relevance, and linguistic accuracy. The project required strong language judgment, adherence to nuanced instructions, and rapid adaptation to evolving quality standards.