For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Alok Kumar

LLM Fine-Tuning & AI Training (BioQwen-NEET)

India flagindore, India
$5.00/hrEntry LevelRoboflow

Key Skills

Software

RoboflowRoboflow

Top Subject Matter

Educational LLMs (NEET Biology)

Top Data Types

TextText
VideoVideo
ImageImage

Top Task Types

Fine-tuningFine-tuning

Freelancer Overview

LLM Fine-Tuning & AI Training (BioQwen-NEET). Core strengths include Hugging Face. Education includes Bachelor of Technology, Acropolis Institute of Technology and Research (2026). AI-training focus includes data types such as Text and labeling workflows including Fine-tuning.

Entry LevelEnglish

Labeling Experience

LLM Fine-Tuning & AI Training (BioQwen-NEET)

TextFine Tuning
Fine-tuned a Qwen2.5-3B LLM on a domain-specific NEET Biology corpus using advanced AI training techniques. Conducted supervised fine-tuning on 16K curated samples and implemented reinforcement learning with custom reward functions for correctness and reasoning. Developed structured tool-calling skills within the model using JSON-formatted tags to improve its reasoning evaluation capability. • Managed large-scale, domain-specific text data in accordance with strict curriculum guidelines. • Focused on model alignment and robust performance through supervised and reinforcement learning methods. • Evaluated model performance using task-specific metrics and validation sets. • Engineered custom reward functions to enhance answer quality and reasoning ability.

Fine-tuned a Qwen2.5-3B LLM on a domain-specific NEET Biology corpus using advanced AI training techniques. Conducted supervised fine-tuning on 16K curated samples and implemented reinforcement learning with custom reward functions for correctness and reasoning. Developed structured tool-calling skills within the model using JSON-formatted tags to improve its reasoning evaluation capability. • Managed large-scale, domain-specific text data in accordance with strict curriculum guidelines. • Focused on model alignment and robust performance through supervised and reinforcement learning methods. • Evaluated model performance using task-specific metrics and validation sets. • Engineered custom reward functions to enhance answer quality and reasoning ability.

2023 - Present

Education

A

Acropolis Institute of Technology and Research

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology
2022 - 2026

Work History

S

sel;f employee

ai engineer

indore
2024 - Present