For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Sunil Kumar

AI/ML Research Intern - Data Preparation for Model Training

India flagRemote, India
$50.00/hrIntermediateLabelbox

Key Skills

Software

LabelboxLabelbox

Top Subject Matter

Cheminformatics Domain Expertise
Computational Chemistry
Force Field Parameterization

Top Data Types

TextText

Top Task Types

Data Collection
Fine Tuning

Freelancer Overview

AI/ML Research Intern - Data Preparation for Model Training. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Postgraduate Course, IIIT Bangalore (2024) and Master of Science, Indian Institute of Technology Madras (2022). AI-training focus includes data types such as Text, Computer Code, and Programming and labeling workflows including Data Collection and Fine-tuning.

IntermediateEnglish

Labeling Experience

Independent Research - ML-Assisted Parameterization and LLM-AI Workflow

Fine Tuning
During independent research in force field development, I utilized ML-assisted parameterization involving Gaussian Process Regression (GPR) to optimize parameters against quantum mechanical reference data. My responsibilities included integrating labeled data and utilizing LLM-assisted workflows to improve accuracy and efficiency. This work directly contributed to the creation and refinement of AI models in computational chemistry settings. • Used GPR for parameter labeling and optimization. • Integrated LLM-based AI workflows for code and parameter selection. • Generated labeled QM datasets using ORCA and Gaussian. • Prepared labeled training data for AI-driven force field models.

During independent research in force field development, I utilized ML-assisted parameterization involving Gaussian Process Regression (GPR) to optimize parameters against quantum mechanical reference data. My responsibilities included integrating labeled data and utilizing LLM-assisted workflows to improve accuracy and efficiency. This work directly contributed to the creation and refinement of AI models in computational chemistry settings. • Used GPR for parameter labeling and optimization. • Integrated LLM-based AI workflows for code and parameter selection. • Generated labeled QM datasets using ORCA and Gaussian. • Prepared labeled training data for AI-driven force field models.

2024 - Present

AI/ML Research Intern - Data Preparation for Model Training

TextData Collection
As an AI/ML Research Intern at BioCogniz, I generated quantum mechanical (QM)-based datasets for machine learning model training and validation. The role involved curating and preparing computational chemistry data for AI model consumption. Tasks included data preparation, validation, and documentation to ensure high-quality datasets for downstream model development. • Generated QM-based datasets for ML training. • Curated and validated data for consistency and reliability. • Collaborated on predictive model development using labeled datasets. • Maintained data pipelines and documentation.

As an AI/ML Research Intern at BioCogniz, I generated quantum mechanical (QM)-based datasets for machine learning model training and validation. The role involved curating and preparing computational chemistry data for AI model consumption. Tasks included data preparation, validation, and documentation to ensure high-quality datasets for downstream model development. • Generated QM-based datasets for ML training. • Curated and validated data for consistency and reliability. • Collaborated on predictive model development using labeled datasets. • Maintained data pipelines and documentation.

2024 - 2025

Education

I

Indian Institute of Technology Madras

Master of Science, Chemistry

Master of Science
2020 - 2022
G

Government National College, Sirsa

Bachelor of Science, Chemistry, Mathematics, Physics

Bachelor of Science
2017 - 2020

Work History

B

BioCogniz

Research Intern (AI/ML for Cheminformatics)

Remote
2024 - 2025
N

Narayana Institute

Chemistry Lecturer

Chennai
2022 - 2024