For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
N

Nandini Gourishetti

Backend Application Engineer

INDIA flag
Bangalore, India
$8.00/hrEntry LevelScale AI

Key Skills

Software

Scale AIScale AI

Top Subject Matter

Programming
Finance
E-commerce

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

Evaluation Rating
Prompt Response Writing SFT

Freelancer Overview

Backend Application Engineer. Brings 1+ year of professional experience across complex professional workflows, research, and quality-focused execution. Education includes Bachelor of Engineering, Sreenidhi Institute of Science and Technology (2024).

Entry LevelHindiTeluguEnglish

Labeling Experience

MCP Tool Fine tuning

TextPrompt Response Writing SFT
This project aims to improve LLM performance in MCP tool-calling tasks by using Reinforcement Learning with Verifiable Rewards (RLVR). It introduces a rubric-based reward system that provides detailed, multidimensional feedback for complex, multi-step reasoning. In this project, you will write a prompt that requires the use of a tool(s) to be fulfilled. You will then observe the trajectory the model uses to generate its response. Your goal is to rewrite the prompt until the model generates an incorrect response. Upon model failure, you will create a rubric that not only defines what an ideal response must contain but also the ideal trajectory the model must use to achieve that response. Your work will enhance the ability of cutting-edge LLMs to provide fitting and sophisticated answers to a diverse set of user prompts

This project aims to improve LLM performance in MCP tool-calling tasks by using Reinforcement Learning with Verifiable Rewards (RLVR). It introduces a rubric-based reward system that provides detailed, multidimensional feedback for complex, multi-step reasoning. In this project, you will write a prompt that requires the use of a tool(s) to be fulfilled. You will then observe the trajectory the model uses to generate its response. Your goal is to rewrite the prompt until the model generates an incorrect response. Upon model failure, you will create a rubric that not only defines what an ideal response must contain but also the ideal trajectory the model must use to achieve that response. Your work will enhance the ability of cutting-edge LLMs to provide fitting and sophisticated answers to a diverse set of user prompts

2025 - 2026

Outlier - AI Trainee

TextEvaluation Rating
Compose a Stellar Prompt: Craft an engaging and clear prompt to get the conversation started. Evaluate & Choose: Review 2 responses, select your preferred one, and justify your choice with thoughtful reasoning. Keep the Conversation Alive: For many tasks (possibly not all!), continue the dialogue by submitting a related prompt and choosing the best response once again!

Compose a Stellar Prompt: Craft an engaging and clear prompt to get the conversation started. Evaluate & Choose: Review 2 responses, select your preferred one, and justify your choice with thoughtful reasoning. Keep the Conversation Alive: For many tasks (possibly not all!), continue the dialogue by submitting a related prompt and choosing the best response once again!

2025 - 2025

Education

S

Sreenidhi Institute of Science and Technology

Bachelor of Engineering, Computer Science and Engineering

Bachelor of Engineering
2020 - 2024

Work History

F

Flipkart

Backend Application Engineer

Bangalore
2024 - Present