Integrated and fine-tuned LLMs with focus on training data quality and pipeline optimization
I collaborated closely with data science teams to integrate, fine-tune, and optimize large language models (LLMs) by preparing and curating relevant datasets. My responsibilities included ensuring data quality for model training, facilitating machine learning best practices, and supporting regular updates to AI systems. I frequently leveraged tools and cloud-based solutions to manage the data and streamline the training pipeline.• Integrated and prepared text data for machine learning model training and fine-tuning. • Worked on ensuring high-quality, diverse, and relevant datasets for LLM evaluation and updates. • Contributed to model versioning, deployment, and ongoing quality improvement initiatives. • Utilized AWS SageMaker and Internal/Proprietary Tooling for data management and training workflows.