Biology Prompt Generation and Model Evaluation
In this project, I generated biology-related question prompts to evaluate the performance of AI language models in scientific reasoning. For each prompt, I analyzed the model’s response to determine its accuracy and coherence. If the answer was incorrect or logically flawed, I identified the point where the reasoning broke down and provided a corrected explanation grounded in verified biological facts.