AI Generalist | LLM Evaluation & Training
I performed large language model (LLM) evaluation and training, focusing on ensuring the quality of training data and assessing model outputs. My work involved annotating AI-generated responses for accuracy, coherence, and logical consistency in both Spanish and English. I also contributed to refining structured question–answer datasets for multilingual AI improvement. • Evaluated and rated LLM outputs for contextual accuracy • Created and refined structured question–answer training data • Conducted source validation and logical consistency checks • Identified edge cases and inconsistencies to optimize model performance.