For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
C

Calori Musi

ML Research & Applied AI – Outlier AI (Google Brain)

Kenya flagNairobi, Kenya
$20.00/hrExpertCVATMindrift

Key Skills

Software

CVATCVAT
MindriftMindrift

Top Subject Matter

Self-supervised learning
tabular data
text classification

Top Data Types

TextText
DocumentDocument

Top Task Types

Fine-tuningFine-tuning
ClassificationClassification
Data CollectionData Collection

Freelancer Overview

ML Research & Applied AI – Outlier AI (Google Brain). Brings 4+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Open University of Kenya (2023). AI-training focus includes data types such as Text and labeling workflows including Fine-tuning.

ExpertEnglish

Labeling Experience

ML Research & Applied AI – Outlier AI (Google Brain)

TextFine Tuning
Participated in fine-tuning large language models (LLMs) and developing self-supervised learning mechanisms for structured tabular data. Co-developed an internal variant of the SAINT architecture, focused on efficient representation learning for text-based classification tasks. Contributed to model deployments and benchmarking in real-world environments.• Led model training data pipeline construction and validation processes.• Ensured dataset quality and diverse coverage for LLM fine-tuning.• Conducted performance evaluation and participated in peer-reviewed research dissemination.• Supported productionization of models involving labeled data for Google Shopping ranking tasks.

Participated in fine-tuning large language models (LLMs) and developing self-supervised learning mechanisms for structured tabular data. Co-developed an internal variant of the SAINT architecture, focused on efficient representation learning for text-based classification tasks. Contributed to model deployments and benchmarking in real-world environments.• Led model training data pipeline construction and validation processes.• Ensured dataset quality and diverse coverage for LLM fine-tuning.• Conducted performance evaluation and participated in peer-reviewed research dissemination.• Supported productionization of models involving labeled data for Google Shopping ranking tasks.

2024 - 2025

Enterprise RAG Platform for Legal Document Intelligence (Project)

TextFine Tuning
Fine-tuned LLaMA-2-13B on domain-specific legal text corpora for Retrieval-Augmented Generation (RAG) systems. Prepared and curated labeled data for supervised learning to improve legal document question answering accuracy. Evaluated outputs against benchmark datasets and iteratively refined model training data.• Developed datasets through careful prompt-response curation and annotation.• Established quality control protocols for annotated legal texts.• Engaged in task-specific RAG evaluation using labeled ground-truth answers.• Facilitated transfer of legal AI systems from prototype to production environments.

Fine-tuned LLaMA-2-13B on domain-specific legal text corpora for Retrieval-Augmented Generation (RAG) systems. Prepared and curated labeled data for supervised learning to improve legal document question answering accuracy. Evaluated outputs against benchmark datasets and iteratively refined model training data.• Developed datasets through careful prompt-response curation and annotation.• Established quality control protocols for annotated legal texts.• Engaged in task-specific RAG evaluation using labeled ground-truth answers.• Facilitated transfer of legal AI systems from prototype to production environments.

2024 - 2024

Education

M

Moringa school

certificate, data science

certificate
2025 - 2025
O

Open University of Kenya

Bachelor of Science, Data Science

Bachelor of Science
2020 - 2023

Work History

F

Fortitude Systems

Data Scientist & Machine Learning Technician

San Francisco
2025 - Present
O

Outlier AI

Data Scientist - Applied Machine Learning

N/A
2024 - 2025