For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Sai Kolla

Project Helix Contributor (AI Model Evaluation & Dataset Construction)

USA flag
Phoenix, Usa
$20.00/hrEntry LevelOther

Key Skills

Software

Other

Top Subject Matter

Software Engineering
AI Evaluation
AI Dataset Sourcing

Top Data Types

TextText
ImageImage
VideoVideo

Top Task Types

Data Collection
Fine Tuning

Freelancer Overview

Project Helix Contributor (AI Model Evaluation & Dataset Construction). Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Science, Arizona State University (2025). AI-training focus includes data types such as Computer Code, Programming, and Text and labeling workflows including Evaluation, Rating, and Data Collection.

Entry LevelEnglish

Labeling Experience

Instruction-Tuned Llama-3.2-3B LLM (Data Labeling & Fine-tuning)

OtherTextFine Tuning
For the Instruction-Tuned Llama-3.2-3B LLM project, I engineered and labeled a dataset for supervised fine-tuning. The process entailed curating instruction-response pairs and employing prompt engineering techniques to improve model performance. My work played a key role in preparing high-quality data for LLM fine-tuning and evaluation. • Constructed and labeled approximately 12,000 instruction-response text samples. • Applied chain-of-thought prompt strategies to enhance dataset effectiveness. • Tuned model inference parameters based on labeled training outcomes. • Conducted quality assurance by reviewing and annotating dataset entries.

For the Instruction-Tuned Llama-3.2-3B LLM project, I engineered and labeled a dataset for supervised fine-tuning. The process entailed curating instruction-response pairs and employing prompt engineering techniques to improve model performance. My work played a key role in preparing high-quality data for LLM fine-tuning and evaluation. • Constructed and labeled approximately 12,000 instruction-response text samples. • Applied chain-of-thought prompt strategies to enhance dataset effectiveness. • Tuned model inference parameters based on labeled training outcomes. • Conducted quality assurance by reviewing and annotating dataset entries.

Not specified

Project Helix Sourcing (Dataset Annotation & Curation)

OtherData Collection
As a member of Project Helix Sourcing, I sourced and annotated open-source pull requests for AI evaluation datasets. My responsibilities included data extraction of repository context, such as expected behaviors and edge cases, for use in large language model evaluation. My work helped curate and prepare datasets critical for AI-assisted software development research. • Analyzed and extracted repository context for LLM evaluation purposes. • Labeled repository data to create structured tasks for model benchmarking. • Collaborated on dataset curation for software engineering domain AI models. • Identified task constraints and edge cases to ensure dataset robustness.

As a member of Project Helix Sourcing, I sourced and annotated open-source pull requests for AI evaluation datasets. My responsibilities included data extraction of repository context, such as expected behaviors and edge cases, for use in large language model evaluation. My work helped curate and prepare datasets critical for AI-assisted software development research. • Analyzed and extracted repository context for LLM evaluation purposes. • Labeled repository data to create structured tasks for model benchmarking. • Collaborated on dataset curation for software engineering domain AI models. • Identified task constraints and edge cases to ensure dataset robustness.

Not specified

Project Helix Contributor (AI Model Evaluation & Dataset Construction)

Other
As a contributor to Project Helix at Handshake AI, I performed technical review and judgment on AI-generated code and repository pull requests. My work involved evaluating open-source software engineering tasks and benchmarking large language model capabilities in code reasoning. I helped construct datasets that improve AI-assisted development systems for software engineering tasks. • Evaluated and annotated open-source pull requests for suitability in AI evaluation datasets. • Identified bugs, failures, and edge cases in code repositories to inform AI evaluation efforts. • Provided detailed technical reasoning for corrections used in LLM model benchmarking. • Structured and labeled data to enable AI models to debug, reason, and provide code fixes.

As a contributor to Project Helix at Handshake AI, I performed technical review and judgment on AI-generated code and repository pull requests. My work involved evaluating open-source software engineering tasks and benchmarking large language model capabilities in code reasoning. I helped construct datasets that improve AI-assisted development systems for software engineering tasks. • Evaluated and annotated open-source pull requests for suitability in AI evaluation datasets. • Identified bugs, failures, and edge cases in code repositories to inform AI evaluation efforts. • Provided detailed technical reasoning for corrections used in LLM model benchmarking. • Structured and labeled data to enable AI models to debug, reason, and provide code fixes.

Not specified

Education

A

Arizona State University

Bachelor of Science, Computer Science

Bachelor of Science
2022 - 2025

Work History

F

First Citizens Bank

Software Engineering Intern

Phoenix
2023 - Present
A

Alameda County ITD

Software Engineering Intern

Oakland
2021 - 2021