For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Samuel Girma Megra

Samuel Girma Megra

Python Developer - Data Science & AI

ETHIOPIA flag
Addis Ababa, Ethiopia
$30.00/hrExpertLabel StudioInternal Proprietary Tooling

Key Skills

Software

Label StudioLabel Studio
Internal/Proprietary Tooling

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
ImageImage
TextText
VideoVideo

Top Label Types

Action Recognition
Classification
Computer Programming Coding
Data Collection
Entity Ner Classification
Fine Tuning
Prompt Response Writing SFT
Question Answering
RLHF

Freelancer Overview

I am a software engineer with hands-on experience in creating and curating high-quality datasets for AI model training, particularly in NLP and supervised fine-tuning projects. My work includes generating, annotating, and refining datasets using Python and Pandas, as well as evaluating and improving model reasoning through techniques like RLHF and Chain-of-Thought rationale correction. I have contributed to projects such as hate speech detection for Afan Oromo, where I handled data preprocessing, feature engineering, and model tuning. I am skilled in building data pipelines, analyzing model behaviors, and optimizing AI workflows across various environments. My technical toolkit includes Python, SQL, Docker, and tools for both backend and data-centric development, enabling me to deliver robust solutions for AI-driven applications.

ExpertEnglish

Labeling Experience

Label Studio

Python AI trainer

Label StudioTextEntity Ner ClassificationQuestion Answering
Scope of the Project: Advancing the Gemini model’s reasoning, agentic tool-use, and multi-modal capabilities for complex data processing and workflow automation. Specific Data Labeling Tasks Performed: Engineered SFT datasets (Python/Pandas) for data-to-plot generation; conducted RLHF to correct Chain-of-Thought reasoning; designed multi-step execution paths for simulated workflows (Slack/Jira); and labeled multi-modal data (video/image annotation). Project Size: Managed high-volume, cross-domain datasets spanning code generation, enterprise automation, and visual reasoning. (Tip: Add a specific number here if you have one, e.g., "Curated 5,000+ data points.") Quality Measures Adhered To: Enforced strict RLHF protocols to eliminate hallucinations and utilized "gold-standard" execution walkthroughs to validate high-fidelity tool-call accuracy.

Scope of the Project: Advancing the Gemini model’s reasoning, agentic tool-use, and multi-modal capabilities for complex data processing and workflow automation. Specific Data Labeling Tasks Performed: Engineered SFT datasets (Python/Pandas) for data-to-plot generation; conducted RLHF to correct Chain-of-Thought reasoning; designed multi-step execution paths for simulated workflows (Slack/Jira); and labeled multi-modal data (video/image annotation). Project Size: Managed high-volume, cross-domain datasets spanning code generation, enterprise automation, and visual reasoning. (Tip: Add a specific number here if you have one, e.g., "Curated 5,000+ data points.") Quality Measures Adhered To: Enforced strict RLHF protocols to eliminate hallucinations and utilized "gold-standard" execution walkthroughs to validate high-fidelity tool-call accuracy.

2023

Education

A

Addis Ababa University

Bachelor of Science, Software Engineering

Bachelor of Science
2017 - 2022

Work History

T

Turing

Python AI trainer

Palo Alto
2022 - Present
A

Addis Ababa University

Lead Backend Developer

Debrezeit
2024 - 2025