For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
O
Olanrewaju Odeajo

Olanrewaju Odeajo

AI Engineer – Data Preparation and LLM-Driven Document Transformation

Nigeria flagIbadan, Nigeria
$40.00/hrExpertAws SagemakerGoogle Cloud Vertex AIAxiom AI

Key Skills

Software

AWS SageMakerAWS SageMaker
Google Cloud Vertex AIGoogle Cloud Vertex AI
Axiom AI

Top Subject Matter

Policy Domain Expertise
Regulatory Compliance & Risk Analysis
LLM fine-tuning

Top Data Types

DocumentDocument
ImageImage
TextText

Top Task Types

Text GenerationText Generation
Entity (NER) ClassificationEntity (NER) Classification
Fine-tuningFine-tuning
RLHFRLHF
Evaluation/RatingEvaluation/Rating
Bounding BoxBounding Box

Freelancer Overview

AI Engineer – Data Preparation and LLM-Driven Document Transformation. Brings 8+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Master of Science, World Quant University (2026) and Post Graduate Diploma, N/A (2025). AI-training focus includes data types such as Document and labeling workflows including Text Generation and Entity (NER) Classification.

ExpertEnglish

Labeling Experience

AI Engineer – Data Preparation and LLM-Driven Document Transformation

DocumentText Generation
Engineered an end-to-end RAG pipeline automating the transformation of policy documents into unified formats suitable for LLM-driven analyses. Integrated OpenAI ada-002 embeddings, LangChain chunking, and Pinecone vector store to power semantic search and requirement matching. Enabled automated policy compliance gap analysis and large-scale document rewriting through LLMs for high coverage audit documentation. • Designed data transformation processes for over 100 JSON/PDF documents across compliance domains. • Developed pipelines for automated chunking and semantic embedding of documents. • Leveraged LLMs for text-based gap analysis and compliant document generation. • Conducted model evaluation to ensure high semantic retrieval accuracy.

Engineered an end-to-end RAG pipeline automating the transformation of policy documents into unified formats suitable for LLM-driven analyses. Integrated OpenAI ada-002 embeddings, LangChain chunking, and Pinecone vector store to power semantic search and requirement matching. Enabled automated policy compliance gap analysis and large-scale document rewriting through LLMs for high coverage audit documentation. • Designed data transformation processes for over 100 JSON/PDF documents across compliance domains. • Developed pipelines for automated chunking and semantic embedding of documents. • Leveraged LLMs for text-based gap analysis and compliant document generation. • Conducted model evaluation to ensure high semantic retrieval accuracy.

2021 - 2023

AI Engineer – Automated Document Data Extraction and Entity Labeling

DocumentEntity Ner Classification
Automated data extraction from over 2,000 invoices in varied formats using hybrid OCR and LLM pipelines. Built extraction systems to classify and accurately parse financial fields from noisy, unstructured documents. Validated structured outputs through schema compliance and accuracy assessment routines. • Integrated OCR (Tesseract) for document digitization and LLMs for text structuring. • Achieved over 95% field-level accuracy in entity recognition and extraction. • Applied deterministic techniques to reduce AI hallucination and enforce schema matches. • Compared cloud and local model deployments for secure and efficient annotation operations.

Automated data extraction from over 2,000 invoices in varied formats using hybrid OCR and LLM pipelines. Built extraction systems to classify and accurately parse financial fields from noisy, unstructured documents. Validated structured outputs through schema compliance and accuracy assessment routines. • Integrated OCR (Tesseract) for document digitization and LLMs for text structuring. • Achieved over 95% field-level accuracy in entity recognition and extraction. • Applied deterministic techniques to reduce AI hallucination and enforce schema matches. • Compared cloud and local model deployments for secure and efficient annotation operations.

2019 - 2021

Education

W

World Quant University

Master of Science, Financial Engineering

Master of Science
2024 - 2026
T

The Institute of Criminology and Strategic Studies

Post Graduate Diploma, Criminology and Strategies Studies

Post Graduate Diploma
2024 - 2025

Work History

G

Global Partnership for Sustainable Development Data

Data Science Fellow

N/A
2024 - Present
L

LoubbyAI

AI Engineer

N/A
2021 - 2023