Charan Konduru - AI Data Annotator

Key Skills

Software

Other

AWS SageMaker

HiveMind

Google Cloud Vertex AI

Mercor

Top Subject Matter

Toxic Comment Detection

Nlp Domain Expertise

Social Media Moderation

Top Data Types

Text

Image

Audio

Top Task Types

Classification

Text Generation

Object Detection

Fine Tuning

Transcription

Data Collection

Computer Programming Coding

Function Calling

Evaluation Rating

Text Summarization

Segmentation

Entity Ner Classification

Point Key Point

Freelancer Overview

I have experience working with data labeling and AI training workflows through hands-on projects in machine learning and NLP. In my toxic comment classification project, I worked extensively on preparing and cleaning text data, handling class imbalance, and ensuring high-quality labeled datasets for model training. This included tasks like preprocessing noisy user-generated content, validating labels, and structuring datasets for supervised learning. I also gained exposure to evaluation processes by comparing multiple models and analyzing performance metrics such as precision, recall, and F1-score, which are critical in assessing label quality and model reliability. Additionally, I have experience building data pipelines and working with large-scale datasets using Python, SQL, and AWS tools like Glue and Athena. In my NYC Taxi Demand Prediction project, I engineered features, validated data consistency, and ensured data quality before training models like LightGBM. My background in data engineering and analytics, combined with an understanding of machine learning workflows, allows me to contribute effectively to AI training tasks that require attention to detail, structured thinking, and high-quality data annotation.

IntermediateEnglish

Labeling Experience

Toxic Comment Classification AI Training & Annotation

OtherTextClassification

Developed a multi-label classification system to identify toxic comments in a large dataset of online text. Leveraged advanced transformer models (BERT, DeBERTa-v3, LLaMA-2) to annotate and classify 160K+ comments by toxicity type and severity. Applied Multi-Label SMOTE to address extreme class imbalance and enhance rare class detection. • Built label definition and annotation schema for toxicity detection, including rare categories. • Implemented manual review and evaluation of automatic toxic comment predictions. • Tuned models based on labeling outcomes to increase minority class F1-scores. • Facilitated QA and correction of misclassified or ambiguous toxic comments.

2024 - 2024

Education

G

Gayatri Vidya Parishad College of Engineering

Bachelor of Technology, Computer Science

Bachelor of Technology

2020 - 2024

S

State University of New York at Buffalo

Master of Science, Data Science

Master of Science

2024

Work History

I

International Sibling Society

Data Science Intern

Las Vegas

2026 - Present

R

Rashtriya Ispat Nigam Limited

Data Scientist

Visakhapatnam

2023 - 2024