For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Anish Kumar

Anish Kumar

Expert AI Data Labeler | NLP, CV, LLM Evaluation & Multimodal Training

India flagBuhana, India
$35.00/hrIntermediateRemotasksScale AIToloka

Key Skills

Software

RemotasksRemotasks
Scale AIScale AI
TolokaToloka
LabelboxLabelbox

Top Subject Matter

No subject matter listed

Top Data Types

DocumentDocument
ImageImage
TextText

Top Task Types

Bounding Box
Classification
Computer Programming Coding
Prompt Response Writing SFT
RLHF

Freelancer Overview

I am currently pursuing a B.Tech in Computer Science Engineering with a specialization in Artificial Intelligence and Machine Learning. Alongside my academic background, I have hands-on experience working on AI training and data labeling projects through platforms like Outlier and Toloka. My tasks have included LLM evaluation, response ranking, text classification, image labeling, and reinforcement learning from human feedback (RLHF). This combination of academic learning and real-world experience has allowed me to develop a strong understanding of how high-quality data powers AI systems. I am skilled in accurately labeling diverse data types (text, image, and multi-modal), evaluating model outputs, and providing critical feedback to improve AI performance. My attention to detail, consistency, and ability to follow complex guidelines help me contribute effectively to the development of cutting-edge AI systems. As someone studying AI and actively working in the field, I bring both technical knowledge and practical experience that set me apart.

IntermediateHindiEnglish

Labeling Experience

Scale AI

AI Training Data Creator — Hint Writing & Model Guidance

Scale AITextRLHFEvaluation Rating
In this project, I am directly involved in generating detailed, diverse, and educational hints that help train AI models to solve complex reasoning and math problems. Each task requires careful understanding of the prompt, step-by-step guidance writing, and iterative testing to ensure the model reaches the correct answer without explicitly providing it. My responsibilities include: Analyzing mathematical and logical prompts, often involving functions, roots, and symmetry. Writing clear, non-trivial hints that progressively lead the AI to the known final answer. Ensuring hints cover multiple reasoning aspects and diverse explanatory approaches. Reviewing the model’s responses based on my hints and refining them if needed. Evaluating whether the ground truth answers are correct and marking discrepancies. This project focuses on enhancing AI reasoning and problem-solving capabilities by producing high-quality training data for supervised fine-tuning and reinforcement learning from

In this project, I am directly involved in generating detailed, diverse, and educational hints that help train AI models to solve complex reasoning and math problems. Each task requires careful understanding of the prompt, step-by-step guidance writing, and iterative testing to ensure the model reaches the correct answer without explicitly providing it. My responsibilities include: Analyzing mathematical and logical prompts, often involving functions, roots, and symmetry. Writing clear, non-trivial hints that progressively lead the AI to the known final answer. Ensuring hints cover multiple reasoning aspects and diverse explanatory approaches. Reviewing the model’s responses based on my hints and refining them if needed. Evaluating whether the ground truth answers are correct and marking discrepancies. This project focuses on enhancing AI reasoning and problem-solving capabilities by producing high-quality training data for supervised fine-tuning and reinforcement learning from

2024
Labelbox

Computer Code/Programming

LabelboxComputer Code ProgrammingComputer Programming Coding
Computer Code/Programming, reviewing code generated by model , checking error in code of the model

Computer Code/Programming, reviewing code generated by model , checking error in code of the model

2022
Scale AI

AI Training Data Labeler for NLP and Sequence Modeling Tasks

Scale AITextRLHFEvaluation Rating
Worked on multiple AI training projects involving generation of high-quality hints, evaluation of model outputs, and reinforcement learning from human feedback (RLHF) tasks. Responsible for creating detailed, step-by-step textual hints to guide AI models to the correct answers, reviewing and rating model responses for accuracy, and refining training datasets for improved AI understanding. Ensured adherence to strict quality guidelines to maximize data reliability and model performance.

Worked on multiple AI training projects involving generation of high-quality hints, evaluation of model outputs, and reinforcement learning from human feedback (RLHF) tasks. Responsible for creating detailed, step-by-step textual hints to guide AI models to the correct answers, reviewing and rating model responses for accuracy, and refining training datasets for improved AI understanding. Ensured adherence to strict quality guidelines to maximize data reliability and model performance.

2024 - 2024

Education

S

SGHSS Public School

!2 th class , 12 th

!2 th class
2021 - 2022
R

R.S.V Secondary Public School

10 th Class, 10 th

10 th Class
2019 - 2020

Work History

No Work History added yet

Anish K. hasn’t added any Work History to their OpenTrain profile yet.