AI Data Expert (Contract) – LILT
I evaluated AI model outputs for linguistic accuracy, reasoning quality, and contextual correctness. I produced high-quality annotated datasets used for the training and refinement of large language models. I identified model weaknesses and edge cases to provide structured feedback improving reliability. • Evaluated model-generated text outputs for accuracy and relevancy. • Annotated datasets to facilitate ongoing large language model improvements. • Identified and reported edge cases and failure scenarios. • Collaborated with software engineering teams to iterate model enhancements.