For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
K

Kezia Wahome

AI Data Preparation and LLM Data Parsing

KENYA flag
Nairobi, Kenya
$18.00/hrIntermediateAws SagemakerData Annotation TechLabelbox

Key Skills

Software

AWS SageMakerAWS SageMaker
Data Annotation TechData Annotation Tech
LabelboxLabelbox
Label StudioLabel Studio
Scale AIScale AI
Axiom AI

Top Subject Matter

Large Language Models
Big Data Cloud Support Engineer - Amazon Sagemaker Expert

Top Data Types

TextText
Geospatial Tiled ImageryGeospatial Tiled Imagery

Top Task Types

Text Generation
Object Detection
Fine Tuning
Prompt Response Writing SFT
Bounding Box
Classification
Function Calling

Freelancer Overview

AI Data Preparation and LLM Log Parsing Pipeline (IBM Research). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science with Honours, African Leadership University (2024) and Certificate, Carnegie Mellon University - Africa (2023). AI-training focus includes data types such as Text and labeling workflows including Text Generation.

IntermediateEnglish

Labeling Experience

AI Data Preparation and LLM Log Parsing Pipeline (IBM Research)

TextText Generation
I participated in building an LLM automated log parsing pipeline for AI model evaluation. My work involved preparing structured textual log datasets and prompting strategies to enhance log-parsing performance. I benchmarked LLM-based parsing against state-of-the-art tools and collaborated with research scientists for production integration. • Conducted large-scale preprocessing of 33M+ Spark logs for effective LLM evaluation. • Designed multiple prompt generation strategies, including KNN retrieval of sample data. • Evaluated models through ablation studies and log dataset annotation for accuracy comparison. • Benchmarked results against tools like Drain, Spell, SPINE, LogPPT, and ChatGPT.

I participated in building an LLM automated log parsing pipeline for AI model evaluation. My work involved preparing structured textual log datasets and prompting strategies to enhance log-parsing performance. I benchmarked LLM-based parsing against state-of-the-art tools and collaborated with research scientists for production integration. • Conducted large-scale preprocessing of 33M+ Spark logs for effective LLM evaluation. • Designed multiple prompt generation strategies, including KNN retrieval of sample data. • Evaluated models through ablation studies and log dataset annotation for accuracy comparison. • Benchmarked results against tools like Drain, Spell, SPINE, LogPPT, and ChatGPT.

2024 - 2024

Education

A

African Leadership University

Bachelor of Science with Honours, Computer Science

Bachelor of Science with Honours
2021 - 2024
C

Carnegie Mellon University - Africa

Certificate, General Studies in Technology and Research

Certificate
2023 - 2023

Work History

A

Amazon

Cloud Support Engineer

Nairobi
2025 - Present
I

IBM

Software Engineer Intern

Nairobi
2024 - 2024