For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
D

Deekshith Malleshaiah

LLM Evaluation & Robustness Engineer

United Kingdom flagLondon, United Kingdom
Entry Level

Key Skills

Software

No software listed

Top Subject Matter

LLM Evaluation and Robustness
Legal Services & Contract Review
Regulatory Compliance & Risk Analysis

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

No task types listed

Freelancer Overview

LLM Evaluation & Robustness Engineer. Brings 5+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, University of Aberdeen and Bachelor of Engineering, Visvesvaraya Technological University. AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

Entry Level

Labeling Experience

LLM Evaluation & Robustness Engineer

Text
Evaluated large language model outputs for correctness, reasoning, factual accuracy, and instruction-following criteria. Designed and executed structured evaluation pipelines to assess model performance and reliability. Provided comprehensive, high-quality feedback and explanations to enhance model and AI system efficacy. • Assessed model responses for safety, consistency, and domain fit. • Developed benchmarking protocols for reasoning and summarization tasks. • Identified model weaknesses including hallucinations and reasoning gaps. • Collaborated with teams to align model outputs with expected behaviors.

Evaluated large language model outputs for correctness, reasoning, factual accuracy, and instruction-following criteria. Designed and executed structured evaluation pipelines to assess model performance and reliability. Provided comprehensive, high-quality feedback and explanations to enhance model and AI system efficacy. • Assessed model responses for safety, consistency, and domain fit. • Developed benchmarking protocols for reasoning and summarization tasks. • Identified model weaknesses including hallucinations and reasoning gaps. • Collaborated with teams to align model outputs with expected behaviors.

2025 - 2025

Education

V

Visvesvaraya Technological University

Bachelor of Engineering, Civil Engineering

Bachelor of Engineering
Not specified
U

University of Aberdeen

Master of Science, Data Science

Master of Science
Not specified

Work History

M

Mind Storms

Lead AI Research Scientist

London
2024 - Present
C

Calm Care

Lead ML Engineer

New York
2025 - 2026