For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Alen Kurian

Alen Kurian

AI Specialist - Large Language Models

INDIA flag
kochi, India
$10.00/hrExpertLabelboxData Annotation Tech

Key Skills

Software

LabelboxLabelbox
Data Annotation TechData Annotation Tech

Top Subject Matter

No subject matter listed

Top Data Types

TextText
AudioAudio

Top Label Types

Classification
Segmentation
Question Answering

Freelancer Overview

I am an AI Specialist and Data Annotation Expert with over five years of hands-on experience in AI training data development, evaluation, and optimization. I have contributed to large-scale projects involving large language models (LLMs), natural language processing (NLP), computer vision, robotics, and generative AI systems. My expertise includes text classification, named entity recognition (NER), sentiment analysis, image and video annotation, multimodal data preparation, prompt engineering, and structured model evaluation. I am highly skilled in Python, annotation platforms such as CVAT and LabelImg, and machine learning frameworks including TensorFlow and PyTorch. My work focuses on ensuring data precision, reasoning validation, safety alignment, and gold-standard dataset creation to enhance AI model accuracy and reliability. With extensive experience collaborating remotely with global AI teams, I consistently deliver high-quality, scalable training data solutions for advanced machine learning applications.

ExpertEnglish

Labeling Experience

Labelbox

Large Language Model Evaluation Framework

LabelboxTextQuestion Answering
Developed structured evaluation metrics for grading reasoning, coherence, factuality, and safety of AI-generated responses.

Developed structured evaluation metrics for grading reasoning, coherence, factuality, and safety of AI-generated responses.

2025
Data Annotation Tech

Music Genre Classification Dataset Labeling

Data Annotation TechAudioSegmentationClassification
Annotated music clips from the GTZAN dataset by genre, creating labeled spectrograms for CNN model training. Performed audio segmentation to isolate key patterns, ensuring balanced representation across 10 genres. Quality assurance involved multiple verification rounds to eliminate mislabels.

Annotated music clips from the GTZAN dataset by genre, creating labeled spectrograms for CNN model training. Performed audio segmentation to isolate key patterns, ensuring balanced representation across 10 genres. Quality assurance involved multiple verification rounds to eliminate mislabels.

2024 - 2024
Labelbox

Credit Card Fraud Detection Dataset Annotation

LabelboxTextClassification
Annotated and validated large-scale credit card transaction datasets for supervised machine learning models, labeling entries as "Fraud" or "Non-Fraud." Ensured data balance between classes, handled missing values, and maintained strict quality control through multiple review passes. The labeled data was used to train Random Forest and Logistic Regression models, achieving high precision and recall in fraud detection.

Annotated and validated large-scale credit card transaction datasets for supervised machine learning models, labeling entries as "Fraud" or "Non-Fraud." Ensured data balance between classes, handled missing values, and maintained strict quality control through multiple review passes. The labeled data was used to train Random Forest and Logistic Regression models, achieving high precision and recall in fraud detection.

2024 - 2024

Education

U

UC College

Master of Computer Applications, Computer Applications

Master of Computer Applications
2023 - 2025
N

NSS College

Bachelor of Computer Applications, Computer Applications

Bachelor of Computer Applications
2020 - 2023

Work History

I

Innodata Inc.

Data Annotation & Evaluation Specialist

kochi
2025 - Present
T

TELUS Digital

AI Specialist

kochi
2025 - Present