Remote Data Scientist (AI Training & Dataset Evaluation)
Developed AI training datasets and evaluation workflows to improve machine learning and LLM performance. Created, curated, and evaluated diverse text-based datasets for use in AI and language model development. Implemented processes for reviewing, scoring, and refining datasets to ensure quality and relevance. • Compiled datasets from disparate sources to suit client requirements • Designed protocols for evaluating LLM outputs • Integrated datasets with model training pipelines • Conducted ongoing dataset quality assurance and documentation