Generative AI Specialist: LLM Evaluation and Code Refinement
Enhanced and evaluated premier Large Language Models, including Claude 3.7, Gemini, and Grok. My primary tasks involved refining model-generated text, correcting complex code, and developing ideal solutions across multiple languages (C, C++, Java, Python). I engineered sophisticated and adversarial prompts to test model limitations and improve robustness. Executed over 150 tasks across 18+ distinct projects, consistently adhering to strict quality measures to ensure the highest standards of accuracy and performance.