For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Shahad Pk

Shahad Pk

AI Trainer & Production-Grade Data Labeling Specialist

India flagKERALA, India
$20.00/hrEntry LevelAws SagemakerAppenGoogle Cloud Vertex AI

Key Skills

Software

AWS SageMakerAWS SageMaker
AppenAppen
Google Cloud Vertex AIGoogle Cloud Vertex AI
Label StudioLabel Studio
OpenCV AI Kit (OAK)OpenCV AI Kit (OAK)
RoboflowRoboflow
Internal/Proprietary Tooling

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
ImageImage
TextText

Top Task Types

Bounding Box
Classification
Computer Programming Coding
Entity Ner Classification
Text Summarization

Freelancer Overview

I am an AI Trainer and Data Labeling Specialist with hands-on experience in text, image, and document annotation from both academic and freelance projects. During my AI internship, I worked on real-world systems including face recognition, document classification, and legal document matching, where I contributed to dataset creation, labeling, cleaning, and validation for model training. I am experienced with tools like Label Studio and cloud-based platforms, and I bring strong Python fundamentals with a practical understanding of NLP and computer vision workflows. I focus on producing clean, consistent training data that directly improves model performance in production environments.

Entry LevelHindiArabicEnglishMalayalam

Labeling Experience

Label Studio

Audio Transcription & Text Data Labeling Project

Label StudioTextEntity Ner ClassificationClassification
Worked on audio-to-text data preparation for AI training by first transcribing raw audio files into clean text format and then performing structured labeling using Label Studio. Tasks included text classification, entity tagging, and content segmentation to prepare high-quality datasets for downstream NLP model training. Also performed data cleaning and validation to ensure transcription accuracy, consistency, and usability for speech and language models.

Worked on audio-to-text data preparation for AI training by first transcribing raw audio files into clean text format and then performing structured labeling using Label Studio. Tasks included text classification, entity tagging, and content segmentation to prepare high-quality datasets for downstream NLP model training. Also performed data cleaning and validation to ensure transcription accuracy, consistency, and usability for speech and language models.

2025
Roboflow

Face Recognition Dataset Labeling for CCTV Surveillance

RoboflowImageBounding BoxClassification
Labeled face image data for a real-time CCTV-based face recognition system during my AI internship. Performed image annotation using bounding boxes and identity/class labels to build a clean training dataset. Supported preprocessing, dataset validation, and testing of recognition accuracy in real-world lighting and camera conditions. The labeled data was used to train and evaluate a deep learning–based face recognition model for attendance and security use cases.

Labeled face image data for a real-time CCTV-based face recognition system during my AI internship. Performed image annotation using bounding boxes and identity/class labels to build a clean training dataset. Supported preprocessing, dataset validation, and testing of recognition accuracy in real-world lighting and camera conditions. The labeled data was used to train and evaluate a deep learning–based face recognition model for attendance and security use cases.

2025
Label Studio

Legal Document Classification & Matching System

Label StudioImageBounding BoxEntity Ner Classification
Worked on a real-world legal document automation system during my AI internship. Performed large-scale text and document labeling using Label Studio for court records including petitions, orders, and case files. Tasks included document classification, entity tagging, and matching related legal documents for downstream model training using Layout-aware NLP pipelines. Also supported data cleaning, validation, and analysis of model predictions to improve accuracy and consistency of the document understanding system.

Worked on a real-world legal document automation system during my AI internship. Performed large-scale text and document labeling using Label Studio for court records including petitions, orders, and case files. Tasks included document classification, entity tagging, and matching related legal documents for downstream model training using Layout-aware NLP pipelines. Also supported data cleaning, validation, and analysis of model predictions to improve accuracy and consistency of the document understanding system.

2025

Education

D

Digital University Kerala

Master of Science, Computer Science

Master of Science
2023 - 2025
F

Farook College, Calicut University

Bachelor of Science, Mathematics

Bachelor of Science
2020 - 2023

Work History

D

DIGITAL UNIVERSITY OF KERALA

AI INTERN

Thiruvananthapuram
2025 - Present
A

ASSET ACADEMY

TEACHER

Malappuram
2022 - 2023