AI & Software Engineer — Remote
As an AI & Software Engineer, I was responsible for building and evaluating LLM systems with a focus on improving reasoning accuracy. I optimized datasets and worked to reduce hallucinations in AI models through iterative error analysis. My work included the development and implementation of structured data pipelines that supported efficient identification and correction of model errors. • Evaluated LLM outputs on reasoning tasks and performed structured error analysis. • Optimized and annotated text datasets for LLM training improvements. • Collaborated remotely on Project Diamond (Handshake AI), focusing on model evaluation. • Used internal/proprietary tools and Python scripting for annotation tasks.