Calori Musi - ML Research & Applied AI – Outlier AI (Google Brain)

Key Skills

Software

CVAT

Mindrift

Top Subject Matter

Self-supervised learning

tabular data

text classification

Top Data Types

Text

Document

Top Task Types

Fine-tuning

Classification

Data Collection

Freelancer Overview

ML Research & Applied AI – Outlier AI (Google Brain). Brings 4+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Open University of Kenya (2023). AI-training focus includes data types such as Text and labeling workflows including Fine-tuning.

ExpertEnglish

Labeling Experience

ML Research & Applied AI – Outlier AI (Google Brain)

TextFine Tuning

Participated in fine-tuning large language models (LLMs) and developing self-supervised learning mechanisms for structured tabular data. Co-developed an internal variant of the SAINT architecture, focused on efficient representation learning for text-based classification tasks. Contributed to model deployments and benchmarking in real-world environments.• Led model training data pipeline construction and validation processes.• Ensured dataset quality and diverse coverage for LLM fine-tuning.• Conducted performance evaluation and participated in peer-reviewed research dissemination.• Supported productionization of models involving labeled data for Google Shopping ranking tasks.

2024 - 2025

Enterprise RAG Platform for Legal Document Intelligence (Project)

TextFine Tuning

Fine-tuned LLaMA-2-13B on domain-specific legal text corpora for Retrieval-Augmented Generation (RAG) systems. Prepared and curated labeled data for supervised learning to improve legal document question answering accuracy. Evaluated outputs against benchmark datasets and iteratively refined model training data.• Developed datasets through careful prompt-response curation and annotation.• Established quality control protocols for annotated legal texts.• Engaged in task-specific RAG evaluation using labeled ground-truth answers.• Facilitated transfer of legal AI systems from prototype to production environments.

2024 - 2024

Education

M

Moringa school

certificate, data science

certificate

2025 - 2025

O

Open University of Kenya

Bachelor of Science, Data Science

Bachelor of Science

2020 - 2023

Work History

F

Fortitude Systems

Data Scientist & Machine Learning Technician

San Francisco

2025 - Present

O

Outlier AI

Data Scientist - Applied Machine Learning

N/A

2024 - 2025