James Masila - AI/Data Scientist – LLM Fine-Tuning & Data Preparation

Key Skills

Software

Data Annotation Tech

iMerit

Micro1

Remotasks

Axiom AI

Top Subject Matter

Large Language Models

Domain-specific Text

Multimodal Scanned Documents

Top Data Types

Text

Document

Image

Top Task Types

Fine Tuning

Classification

Freelancer Overview

AI/Data Scientist – LLM Fine-Tuning & Data Preparation. Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Doctor of Philosophy, University of Nairobi (2022) and Bachelor of Science, Meru University of Science and Technology (2020). AI-training focus includes data types such as Text and Document and labeling workflows including Fine-tuning and Classification.

IntermediateEnglish

Labeling Experience

Multimodal AI Document Understanding – Data Annotation

DocumentClassification

I built multimodal document understanding systems by labeling scanned document data for classification and automated extraction. Tasks included annotating and validating both text and image content for the development of OCR and vision-based models. I combined manual annotation with automated pipeline approaches for efficient processing. • Labeled and validated documents for OCR model training. • Annotated multi-format content for text-image understanding models. • Contributed to dataset curation for document classification and extraction. • Collaborated with team members to verify annotation consistency and accuracy.

2022 - Present

AI/Data Scientist – LLM Fine-Tuning & Data Preparation

TextFine Tuning

I fine-tuned transformer-based language models for domain-specific tasks by curating and annotating datasets. This included preparing text corpora, designing prompt-response pairs, and configuring parameter-efficient fine-tuning workflows. Evaluation frameworks were integrated to ensure model reliability and minimize hallucinations. • Built scalable pipelines for text data annotation and LLM fine-tuning. • Implemented prompt engineering and data augmentation for better model adaptation. • Optimized dataset quality and relevance for supervised fine-tuning and RLHF. • Integrated evaluation and safety monitoring tools throughout the training process.

2022 - Present

Education

M

Meru University of Science and Technology

Bachelor of Science, Computer Science

Bachelor of Science

2016 - 2020

Work History

N

N/A

AI/Data Scientist – Generative AI Systems

Nairobi

2022 - Present

N

N/A

Machine Learning Engineer

Nairobi

2020 - 2021