Prompting and Evaluating LLM conversations (general / coding / STEM specific / math)
Prompting LLMs with various tasks, including programming, STEM specific, and mathematical problems. Evaluating the conversations along the following dimensions: - LLM answer accuracy - LLM answer safety - LLM answer verbosity and style Providing expert opinion along these dimensions