Software Engineer — LLM Data Generation
Contributed to artificial intelligence development by generating data for software-based research and large language model (LLM) training. Participated in tasks focused on refining LLM capabilities through the creation and collection of programming code samples. Supported research and product teams to compile high-quality datasets to advance natural language and code understanding models. • Generated and curated code examples for LLM fine-tuning and testing. • Focused on Python and software-based programming data for machine learning. • Collaborated with AI researchers to identify optimal data collection strategies. • Ensured data accuracy and compliance with project standards.