LLM Response Evaluation & Multilingual Text/Image Annotation
Worked on multiple data labeling projects through Outlier.ai and Remotasks. On Outlier, I contributed to improving large language models by evaluating and ranking AI-generated chatbot responses based on clarity, tone, accuracy, and helpfulness in both Arabic and English. I also participated in prompt + response evaluations to assess language quality and usability. On Remotasks, I completed image annotation tasks using bounding boxes and classification for e-commerce products. Additionally, I labeled text for sentiment and category, following clear instructions and adhering to quality control standards. Across all projects, I prioritized accuracy, attention to task guidelines, and consistency in multilingual feedback.