LLM Evaluation
In my time at Revelo, I was involved in AI training data flows that were centered around the assessment and optimization of LLM (large language model) output. Some of the duties I performed include checking for accuracy, coherence, adherence to instructions, and handling edge cases using a standardized set of metrics.