Sriya Pothula - AI Engineer Intern - RAG Pipeline and QA Labeling

Key Skills

Software

No software listed

Top Subject Matter

Mental Health Datasets / QA Systems

Clinical / Medical Domain QA Systems

Top Data Types

Text

Top Task Types

Question Answering

Freelancer Overview

AI Engineer Intern - RAG Pipeline and QA Labeling. Brings 5+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, George Mason University (2025) and Bachelor of Science, SRKR Engineering College (2020). AI-training focus includes data types such as Text and labeling workflows including Question Answering.

Intermediate

Labeling Experience

AI Engineer Intern - RAG Pipeline and QA Labeling

TextQuestion Answering

Built an end-to-end Retrieval-Augmented Generation (RAG) pipeline using embedding models and LLMs to deliver context-aware answers based on large-scale datasets. Focused on preprocessing, labeling, and curating mental health data to ensure high-quality model training and retrieval accuracy. Designed question-answering tasks requiring annotation, dataset curation, and embedding alignment for AI pipeline optimization. • Constructed embeddings for dataset chunks and aligned them with LLM outputs for QA tasks. • Performed QA annotation to improve data relevance in semantic search workflows. • Utilized Python and embedding models to generate labeled training/test sets for the RAG system. • Ensured dataset diversity and representativeness for accurate question answering.

2025 - 2025

Technical Project - Clinical QA Data Annotation

TextQuestion Answering

Developed a clinical question answering system leveraging transformer-based models (SBERT, ClinicalBERT, T5) for accurate medical responses. Curated, cleaned, and annotated clinical text datasets to train and evaluate semantic search and answer generation models. Labeled question-answer pairs to support model fine-tuning, evaluation, and error analysis tasks. • Organized data into labeled QA formats suitable for transformer architectures. • Generated question-answer pairs and conducted manual annotation for clinical accuracy. • Utilized FAISS for embedding storage and fast semantic retrieval in annotated data. • Evaluated answer quality by performing manual rating and error tagging of outputs.

Not specified

Education

G

George Mason University

Master of Science, Data Analytics Engineering

Master of Science

2024 - 2025

S

SRKR Engineering College

Bachelor of Science, Electronics and Communications Engineering

Bachelor of Science

2016 - 2020

Work History

T

Top Marble & Granite

Business Intelligence Intern

Fairfax

2025 - 2025

A

Alfa8

AI Engineer Intern

Fairfax

2025 - 2025