Software Engineer & AI Labeling Specialist (Web, Mobile & Data QA)
Quebec City, Canada
$15.00/hrIntermediateRemotasksSurge AIOther
Key Skills
Software
Remotasks
Surge AI
Other
Scale AI
Top Subject Matter
No subject matter listed
Top Data Types
Computer Code Programming
Image
Text
Top Task Types
Computer Programming Coding
Evaluation Rating
Prompt Response Writing SFT
Text Generation
Freelancer Overview
I am an AI training and data labeling specialist, as well as a full-stack developer, with strong experience in LLM evaluation, code annotation, and prompt quality analysis. I regularly review JavaScript, TypeScript, and Python code, debug AI-generated code, explain function behavior, implement architectural improvements, add documentation, and provide structured labels for clarity and correctness.
I apply solid software engineering practices including SOLID principles, clean code methodologies, and modular software architecture to ensure maintainable, scalable, and production-ready solutions. My technical background includes building and reviewing full-stack applications, designing API-driven systems, and contributing to robust software architectures.
I also have hands-on experience evaluating model outputs for reasoning, safety, and style across text, image, and multimodal workflows (GPT-4o, DALL·E 3, Voiceflow). My work includes JSON/function-calling evaluation, RAG evaluation, agent behavior analysis, prompt optimization, and conversational quality review.
With a strong foundation in software engineering, I confidently handle structured data pipelines, API-based workflows, mobile computer vision tasks, chatbot automation, customer support AI, and end-to-end system integration. I speak English, French, Arabic, and Chinese, enabling high-quality multilingual evaluation and international collaboration.
IntermediateArabicFrenchEnglishChinese Mandarin
Labeling Experience
LLM Evaluation & Prompt Quality Review
OtherTextText GenerationText Summarization
Evaluated GPT-4o outputs for accuracy, clarity, tone, and factual consistency across multilingual prompts (French, English, Arabic, Chinese). Performed A/B comparisons, scored model responses, detected hallucinations, and validated safety compliance. Reviewed and optimized prompts to improve response quality for customer-support chatbots and automated workflows.
Evaluated GPT-4o outputs for accuracy, clarity, tone, and factual consistency across multilingual prompts (French, English, Arabic, Chinese). Performed A/B comparisons, scored model responses, detected hallucinations, and validated safety compliance. Reviewed and optimized prompts to improve response quality for customer-support chatbots and automated workflows.
2025
Education
L
Laval University
Bachelor's Degree, Software Engineering
Bachelor's Degree
2023
Work History
P
personal project
Automation of B2B prospecting with LinkedIn & Make