LLM Data Labeling and Model Training for RAG Systems (AI Engineer at NeQabty)
I extracted, cleaned, and prepared photo and document-derived text data for use in retrieval-augmented generation (RAG) systems. I deployed and fine-tuned multiple LLM-based models using processed data to support AI retrieval tasks. Responsibilities included crafting representative datasets and ensuring input quality for model performance. • Managed document and photo-derived text annotation and data labeling for LLM-based Azure OpenAI RAG systems. • Created and maintained custom text datasets to enhance model training and retrieval relevance. • Used Azure AI Foundry, LangChain, and internal tools for data processing and labeling workflows. • Focused on business document understanding and automation within RAG pipelines.