We need senior‑minded Python engineers to design and maintain the infrastructure that powers our LLM‑training workflow. You’ll architect secure sandboxes, task environments, and CI/CD pipelines; write clean, test‑driven code (pytest); containerize services with Docker; and support researchers who rely on these tools. Applicants should bring 5 + years of professional Python experience, strong Linux command‑line skills, familiarity with FastAPI/Flask back‑ends, and hands‑on knowledge of GitHub Actions or similar CI. Comfort mentoring teammates and adopting AI coding tools (e.g., Cursor, Claude Code) is a plus. The project delivers reusable repositories, scoring pipelines, and developer environments for evaluating agent performance. Work is fully remote for candidates in the Asia‑Low region (Afghanistan through Vietnam). Hourly rates are tiered by experience: Junior $9, Middle $12, and Senior $16 USD. Selected candidates will complete a quick HackerRank assessment and platform coding test before scheduling recruiter interviews.
Total Budget
$2,500
Pay per Label
$12.5/hr
Time Requirement
20+ hrs/week
Duration
1-3 months
Python‑based LLM agent tasks and evaluation artifacts
Software
Hiring Type
Required Location
Workload / Schedule
Flexible schedule, must be able to complete coding test & interivew within 5 days
Software
Data Type
Label Types
Subject Matter / Industry
Language
Job Type
Share link