Multimodal AI Evaluation
Worked on human preference ranking workflows for multimodal generative AI systems, evaluating image and video outputs across dimensions such as prompt alignment, aesthetic quality, consistency, artifact detection, and content policy adherence. Contributed detailed comparative evaluations used to refine model performance and training datasets.