Fellow @ Handshake AI
In this role, I collaborated with researchers to improve the performance of large language models (LLMs) on tasks involving psychology and English-language reasoning. My primary responsibility was the testing and evaluation of model outputs to identify contextual errors and propose targeted improvements. I regularly assessed the accuracy and relevance of AI-generated responses within my areas of expertise. • Conducted critical analysis to spot subtle errors in LLM output. • Provided targeted feedback to enhance language model performance. • Utilized subject matter expertise in psychology and English literature. • Supported iterative improvements to LLM training processes.