Visual Grounding & Referring Expressions
Categorized and annotated visual grounding tasks involing referring expression generation for AI model training. Identified and labeled challenge types including Similar-looking Neighbours, Fine Boundaries, Occlusion, and Low Contrast scenes. Evaluated model outputs for spatial accuracy and linguistic grounding quality across large image datasets