Senior Software Engineer (LLM Workflow/Data Labeling)
As Senior Software Engineer, I designed and deployed LLM-powered workflows for summarization and classification, leveraging prompt templates and RAG pipelines. These workflows involved the creation, curation, and evaluation of labeled datasets for high-precision answers over internal knowledge bases. Evaluation harnesses with golden sets, offline benchmarks, and structured scoring were implemented to ensure answer quality and reduce hallucinations. • Designed, deployed, and iteratively improved LLM-based summarization/classification pipelines. • Generated, enriched, and reviewed gold-standard labeled datasets for continuous AI evaluation. • Created evaluation scripts and scorecards to benchmark and monitor ML output quality. • Collaborated with product and engineering to translate use cases into labeling/evaluation tasks.