For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
D

Debojyoti Bhuinya

Prompt Engineering and LLM Fine-Tuning for Q&A Accuracy

India flagKolkata, India
$20.00/hrIntermediateOneformaAws SagemakerData Annotation Tech

Key Skills

Software

OneFormaOneForma
AWS SageMakerAWS SageMaker
Data Annotation TechData Annotation Tech
iMeritiMerit
RemotasksRemotasks
SuperAnnotateSuperAnnotate

Top Subject Matter

Education/tutoring Domain Expertise
Domain-Specific Slides
Educational Content/Exam Preparation

Top Data Types

TextText
ImageImage

Top Task Types

Fine-tuningFine-tuning
Text GenerationText Generation
ClassificationClassification

Freelancer Overview

Prompt Engineering and LLM Fine-Tuning for Q&A Accuracy. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Brainware University (2025). AI-training focus includes data types such as Text and Image and labeling workflows including Fine-tuning, Text Generation, and Classification.

IntermediateEnglishBengaliHindi

Labeling Experience

Prompt Engineering and LLM Fine-Tuning for Q&A Accuracy

TextFine Tuning
I performed prompt engineering and LLM fine-tuning to improve Q&A accuracy for domain-specific slide content on SlideCoach's AI-driven tutoring platform. My work centered on preparing and curating textual datasets for model training and validation in real-world education contexts. I used AWS Bedrock and SageMaker as the main AI infrastructure throughout the project. • Designed and applied prompts for LLM performance improvement. • Selected, organized, and augmented text-based datasets for supervised fine-tuning and evaluation. • Monitored and evaluated model outputs to enhance answer relevance and correctness. • Used internal/proprietary tooling integrated with AWS services for data ingestion and model feedback cycles.

I performed prompt engineering and LLM fine-tuning to improve Q&A accuracy for domain-specific slide content on SlideCoach's AI-driven tutoring platform. My work centered on preparing and curating textual datasets for model training and validation in real-world education contexts. I used AWS Bedrock and SageMaker as the main AI infrastructure throughout the project. • Designed and applied prompts for LLM performance improvement. • Selected, organized, and augmented text-based datasets for supervised fine-tuning and evaluation. • Monitored and evaluated model outputs to enhance answer relevance and correctness. • Used internal/proprietary tooling integrated with AWS services for data ingestion and model feedback cycles.

2025 - Present

Dataset Generation and Annotation for Conversational AI

TextText Generation
I developed an AI video generation service and conversational AI that automated exam preparation content creation using structured PDF documents. This involved generating training data for context-aware models and preparing datasets for downstream NLP tasks. I primarily used AWS Bedrock, ChromaDB, and custom cloud pipelines for annotation and content validation. • Generated question-answer pairs and structured summaries from educational PDF documents. • Labeled and organized textual data to support conversational AI and RAG system training. • Implemented and validated data augmentation techniques to increase dataset diversity. • Evaluated the quality and contextual fit of generated texts in end-user study scenarios.

I developed an AI video generation service and conversational AI that automated exam preparation content creation using structured PDF documents. This involved generating training data for context-aware models and preparing datasets for downstream NLP tasks. I primarily used AWS Bedrock, ChromaDB, and custom cloud pipelines for annotation and content validation. • Generated question-answer pairs and structured summaries from educational PDF documents. • Labeled and organized textual data to support conversational AI and RAG system training. • Implemented and validated data augmentation techniques to increase dataset diversity. • Evaluated the quality and contextual fit of generated texts in end-user study scenarios.

2025 - 2025

OCR Dataset Annotation for Bengali Handwriting

ImageClassification
I built and annotated a custom dataset of Bengali handwritten text images to develop an OCR system using EfficientNet architecture. The data labeling process involved manually categorizing and verifying image labels for complex handwritten scripts in the Bengali language. I utilized proprietary scripts and manual curation for dataset cleaning and validation. • Collected and labeled hundreds of images for supervised learning and model validation. • Verified class labels against ground truth for robust benchmarking. • Preprocessed images to align, de-skew, and normalize handwritten words. • Assisted in evaluating model recognition accuracy on labeled data.

I built and annotated a custom dataset of Bengali handwritten text images to develop an OCR system using EfficientNet architecture. The data labeling process involved manually categorizing and verifying image labels for complex handwritten scripts in the Bengali language. I utilized proprietary scripts and manual curation for dataset cleaning and validation. • Collected and labeled hundreds of images for supervised learning and model validation. • Verified class labels against ground truth for robust benchmarking. • Preprocessed images to align, de-skew, and normalize handwritten words. • Assisted in evaluating model recognition accuracy on labeled data.

2023 - 2024

Education

B

Brainware University

Bachelor of Technology, Computer Science (Artificial Intelligence and Machine Learning)

Bachelor of Technology
2021 - 2025

Work History

S

SlideCoach

Machine Learning Engineer

Kolkata
2025 - Present
C

Campus Ready

AI/ML Tech Intern

Kolkata
2025 - 2025