For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
J

Jayvishal Shah

Operations Team Lead – Human Data; RLHF, SFT & Red Team Specialist

INDIA flag
Remote, India
$10.00/hrExpertOtherDon T Disclose

Key Skills

Software

Other
Don't disclose

Top Subject Matter

AI Safety
Red Teaming
Large Language Models

Top Data Types

TextText
VideoVideo
ImageImage

Top Task Types

Red Teaming
Data Collection
Classification

Freelancer Overview

Operations Team Lead – Human Data; RLHF, SFT & Red Team Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Bachelor of Science, Hemvati Nandan Bahuguna Garhwal University (2023). AI-training focus includes data types such as Text and Video and labeling workflows including Red Teaming, Data Collection, and Evaluation.

ExpertHindiEnglish

Labeling Experience

Trusted Tester & Red Team Contributor

Don T DiscloseText
Contributed structured adversarial inputs and safety evaluation reports to trusted tester programs for frontier LLMs and multimodal AI models. Identified safety gaps in language, code, image, video, and speech systems to inform responsible scaling. Assessed generative systems pre-deployment and provided recommendations for policy compliance and robustness. • Tested LLMs, code generation, and multimodal models for safety and fairness. • Delivered structured feedback across text, image, speech, and video modalities. • Identified vulnerabilities in diffusion and generative models before launch. • Supported rollout of safety mitigations and harm reduction best practices.

Contributed structured adversarial inputs and safety evaluation reports to trusted tester programs for frontier LLMs and multimodal AI models. Identified safety gaps in language, code, image, video, and speech systems to inform responsible scaling. Assessed generative systems pre-deployment and provided recommendations for policy compliance and robustness. • Tested LLMs, code generation, and multimodal models for safety and fairness. • Delivered structured feedback across text, image, speech, and video modalities. • Identified vulnerabilities in diffusion and generative models before launch. • Supported rollout of safety mitigations and harm reduction best practices.

2024 - Present

Operations Team Lead – Human Data; RLHF, SFT & Red Team Specialist

TextRed Teaming
Led bilingual red teaming and evaluation projects to uncover LLM failure modes and improve model safety. Designed and managed end-to-end QA for high-volume adversarial prompt datasets, ensuring annotation consistency and accuracy. Resolved complex labeling issues and translated findings into systematic process improvements. • Oversaw 100+ AI safety trainers in large-scale adversarial data annotation. • Developed prompt libraries and structured evaluation pipelines for RLHF and SFT. • Embedded human feedback to enhance model alignment and refusal behavior. • Collaborated with researchers and operations to implement workflow improvements.

Led bilingual red teaming and evaluation projects to uncover LLM failure modes and improve model safety. Designed and managed end-to-end QA for high-volume adversarial prompt datasets, ensuring annotation consistency and accuracy. Resolved complex labeling issues and translated findings into systematic process improvements. • Oversaw 100+ AI safety trainers in large-scale adversarial data annotation. • Developed prompt libraries and structured evaluation pipelines for RLHF and SFT. • Embedded human feedback to enhance model alignment and refusal behavior. • Collaborated with researchers and operations to implement workflow improvements.

2024 - Present

AI-Powered Content Moderation System Developer

VideoClassification
Built and deployed a real-time content moderation system using multimodal models to classify user-uploaded videos and captions for policy enforcement. Developed automated pipelines for content review, flagging, and human escalation on edge cases. Maintained continuous dataset expansion for improving accuracy on adversarial and edge-case uploads. • Engineered multi-tier video classification and moderation workflow. • Applied CLIP and transformer models for joint video-text analysis. • Managed dataset growth for alignment with evolving platform policies. • Reduced false positives and improved edge-case detection.

Built and deployed a real-time content moderation system using multimodal models to classify user-uploaded videos and captions for policy enforcement. Developed automated pipelines for content review, flagging, and human escalation on edge cases. Maintained continuous dataset expansion for improving accuracy on adversarial and edge-case uploads. • Engineered multi-tier video classification and moderation workflow. • Applied CLIP and transformer models for joint video-text analysis. • Managed dataset growth for alignment with evolving platform policies. • Reduced false positives and improved edge-case detection.

2026 - 2026

Freelance Human Data Specialist

OtherTextData Collection
Curated and annotated unstructured email and meeting note datasets to train and validate AI assistants. Provided structured human feedback to refine email classification and retrieval system performance. Supported seamless integration of human labeled data into engineering workflows. • Labeled and reviewed datasets for an AI inbox assistant and meeting note retriever. • Enhanced auto-reply and meeting information extraction accuracy. • Proposed architecture for conversational memory models. • Bridged data quality efforts between annotation and model deployment.

Curated and annotated unstructured email and meeting note datasets to train and validate AI assistants. Provided structured human feedback to refine email classification and retrieval system performance. Supported seamless integration of human labeled data into engineering workflows. • Labeled and reviewed datasets for an AI inbox assistant and meeting note retriever. • Enhanced auto-reply and meeting information extraction accuracy. • Proposed architecture for conversational memory models. • Bridged data quality efforts between annotation and model deployment.

2025 - 2026

Education

H

Hemvati Nandan Bahuguna Garhwal University

Bachelor of Science, Physics, Chemistry and Mathematics

Bachelor of Science
2020 - 2023

Work History

F

FyxerAI

Freelance Human Data Specialist

Remote
2025 - 2026