Jayvishal Shah - Operations Team Lead – Human Data; RLHF, SFT & Red Team Specialist

Key Skills

Software

Other

Don't disclose

Top Subject Matter

AI Safety

Red Teaming

Large Language Models

Top Data Types

Text

Video

Image

Top Task Types

Red Teaming

Data Collection

Classification

Freelancer Overview

Operations Team Lead – Human Data; RLHF, SFT & Red Team Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Bachelor of Science, Hemvati Nandan Bahuguna Garhwal University (2023). AI-training focus includes data types such as Text and Video and labeling workflows including Red Teaming, Data Collection, and Evaluation.

ExpertHindiEnglish

Labeling Experience

Trusted Tester & Red Team Contributor

Don T DiscloseText

Contributed structured adversarial inputs and safety evaluation reports to trusted tester programs for frontier LLMs and multimodal AI models. Identified safety gaps in language, code, image, video, and speech systems to inform responsible scaling. Assessed generative systems pre-deployment and provided recommendations for policy compliance and robustness. • Tested LLMs, code generation, and multimodal models for safety and fairness. • Delivered structured feedback across text, image, speech, and video modalities. • Identified vulnerabilities in diffusion and generative models before launch. • Supported rollout of safety mitigations and harm reduction best practices.

2024 - Present

Operations Team Lead – Human Data; RLHF, SFT & Red Team Specialist

TextRed Teaming

Led bilingual red teaming and evaluation projects to uncover LLM failure modes and improve model safety. Designed and managed end-to-end QA for high-volume adversarial prompt datasets, ensuring annotation consistency and accuracy. Resolved complex labeling issues and translated findings into systematic process improvements. • Oversaw 100+ AI safety trainers in large-scale adversarial data annotation. • Developed prompt libraries and structured evaluation pipelines for RLHF and SFT. • Embedded human feedback to enhance model alignment and refusal behavior. • Collaborated with researchers and operations to implement workflow improvements.

2024 - Present

AI-Powered Content Moderation System Developer

VideoClassification

Built and deployed a real-time content moderation system using multimodal models to classify user-uploaded videos and captions for policy enforcement. Developed automated pipelines for content review, flagging, and human escalation on edge cases. Maintained continuous dataset expansion for improving accuracy on adversarial and edge-case uploads. • Engineered multi-tier video classification and moderation workflow. • Applied CLIP and transformer models for joint video-text analysis. • Managed dataset growth for alignment with evolving platform policies. • Reduced false positives and improved edge-case detection.

2026 - 2026

Freelance Human Data Specialist

OtherTextData Collection

Curated and annotated unstructured email and meeting note datasets to train and validate AI assistants. Provided structured human feedback to refine email classification and retrieval system performance. Supported seamless integration of human labeled data into engineering workflows. • Labeled and reviewed datasets for an AI inbox assistant and meeting note retriever. • Enhanced auto-reply and meeting information extraction accuracy. • Proposed architecture for conversational memory models. • Bridged data quality efforts between annotation and model deployment.

2025 - 2026

Education

H

Hemvati Nandan Bahuguna Garhwal University

Bachelor of Science, Physics, Chemistry and Mathematics

Bachelor of Science

2020 - 2023

Work History

F

FyxerAI

Freelance Human Data Specialist

Remote

2025 - 2026