For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
I

Immanual Daniel

AI Model Response Validation and Evaluation (NVIDIA)

India flagBengaluru, India
$20.00/hrExpertOneformaMindriftOpencv AI Kit Oak

Key Skills

Software

OneFormaOneForma
MindriftMindrift
OpenCV AI Kit (OAK)OpenCV AI Kit (OAK)

Top Subject Matter

AI-generated Text Validation
LLM Guardrails
Compliance Testing

Top Data Types

TextText
DocumentDocument

Top Task Types

Prompt Response Writing SFT

Freelancer Overview

AI Model Response Validation and Evaluation (NVIDIA). Brings 8+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Computer Applications, East West College of Management (2018) and Bachelor of Computer Applications, Vivekananda Degree College (2015). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Prompt + Response Writing (SFT).

ExpertEnglish

Labeling Experience

RAG Pipeline Output Evaluation (CSA Agent Project)

Text
As part of the CSA Agent AI Customer Support Automation project, I evaluated the contextual relevance and traceability of LLM-generated responses. I participated in the systematic testing of Retrieval Augmented Generation (RAG) pipelines, verifying accuracy and appropriateness of results. My focus included the identification and flagging of mismatches between AI-generated outputs and expected customer support resolutions. • Performed RAG-specific validation of conversational customer support agents. • Assessed LLM response quality for real-world customer support scenarios. • Collaborated on process improvements for RAG pipeline output review. • Implemented structured review and annotation of support dialogues.

As part of the CSA Agent AI Customer Support Automation project, I evaluated the contextual relevance and traceability of LLM-generated responses. I participated in the systematic testing of Retrieval Augmented Generation (RAG) pipelines, verifying accuracy and appropriateness of results. My focus included the identification and flagging of mismatches between AI-generated outputs and expected customer support resolutions. • Performed RAG-specific validation of conversational customer support agents. • Assessed LLM response quality for real-world customer support scenarios. • Collaborated on process improvements for RAG pipeline output review. • Implemented structured review and annotation of support dialogues.

2022 - Present

Prompt Validation and Optimization (NVIDIA/FIN Bot AI Project)

TextPrompt Response Writing SFT
In the role of QA Engineer, I actively participated in LLM workflow evaluation that involved prompt validation and curation. I constructed, reviewed, and refined prompts and corresponding responses to optimize AI agent performance. My responsibilities included subjective and objective rating of generated text and improvement of prompt tuning methods. • Developed and maintained a comprehensive suite for prompt-response validation. • Applied domain expertise to enhance context accuracy in AI-generated content. • Monitored observability metrics to inform prompt adjustments and reduce model error rates. • Enhanced feedback loops between test generation and LLM behavior in production.

In the role of QA Engineer, I actively participated in LLM workflow evaluation that involved prompt validation and curation. I constructed, reviewed, and refined prompts and corresponding responses to optimize AI agent performance. My responsibilities included subjective and objective rating of generated text and improvement of prompt tuning methods. • Developed and maintained a comprehensive suite for prompt-response validation. • Applied domain expertise to enhance context accuracy in AI-generated content. • Monitored observability metrics to inform prompt adjustments and reduce model error rates. • Enhanced feedback loops between test generation and LLM behavior in production.

2022 - Present

AI Model Response Validation and Evaluation (NVIDIA)

Text
As a QA Engineer working with NVIDIA, I implemented AI response validation and guardrail testing strategies for large language model (LLM) workflows. I evaluated AI model outputs, assessed prompt engineering effectiveness, and validated the reliability and accuracy of generative AI-based systems. Additionally, I performed structured audits of model responses to ensure risk mitigation and compliance to regulatory standards. • Led prompt evaluation and RAG validation processes for enterprise AI systems. • Designed and executed tests targeting hallucination risks and compliance failures. • Used Python-based internal automation tools for AI model result evaluation in CI/CD pipelines. • Collaborated with developers to integrate structured LLM output validation into automated test frameworks.

As a QA Engineer working with NVIDIA, I implemented AI response validation and guardrail testing strategies for large language model (LLM) workflows. I evaluated AI model outputs, assessed prompt engineering effectiveness, and validated the reliability and accuracy of generative AI-based systems. Additionally, I performed structured audits of model responses to ensure risk mitigation and compliance to regulatory standards. • Led prompt evaluation and RAG validation processes for enterprise AI systems. • Designed and executed tests targeting hallucination risks and compliance failures. • Used Python-based internal automation tools for AI model result evaluation in CI/CD pipelines. • Collaborated with developers to integrate structured LLM output validation into automated test frameworks.

2022 - Present

Education

E

East West College of Management

Master of Computer Applications, Computer Applications

Master of Computer Applications
2018 - 2018
V

Vivekananda Degree College

Bachelor of Computer Applications, Computer Applications

Bachelor of Computer Applications
2015 - 2015

Work History

K

Kloc Technologies

QA Engineer

Bengaluru
2022 - Present
H

HR Mantra Software

QA Engineer

Bengaluru
2020 - 2022