Posted: Nov 12, 2024

Human Feedback-Driven Code Generation for Python

Scale AIComputer Code ProgrammingPay Per Hour

Overview

Dataset

Labeling Details

Hiring

Budget

Client

Project Overview

This project focuses on generating Reinforcement Learning with Human Feedback (RLHF) data to enhance code generation capabilities for Python development. The goal is to collect feedback from developers to improve the accuracy and quality of Python code outputs produced by large language models (LLMs). Participants will be asked to review, correct, and optimize auto-generated Python scripts, functions, and algorithms. The data collected will be used to train LLMs to better understand coding standards, efficient practices, and problem-solving strategies in Python, ultimately leading to higher-quality and more reliable code suggestions for real-world applications.

Estimated Total Earnings: $50,000.00Pay Per HourEntry Level3-6 monthsAgency or Individual

Estimated Total Earnings

$50,000.00

Pay per Hour

$50.00/hr

Time Requirement

Flexible

Duration

3-6 months

Labelers Needed

Description of dataset

Leetcode-esque prompt and answers

Software

Scale AI

Hiring Type

Agency or Individual

Required Location

Global Any Location

Workload / Schedule

Work 15 hours weekly for 4-6 months, no time requirements.

Software

Scale AI

Data Type

Computer Code Programming

Task Types

Computer Programming Coding

Subject Matter / Industry

Python code

Language

English

Job Type

Self Service

Activity on this project

Proposals: 2159

Invites sent: 0

Unanswered invites: 0

Share this project

Share link