11 Nov 2025
Tucson, here we come (again)! Our paper “FG-TRACER” accepted at WACV 2026!
Multimodal Large Language Models (MLLMs) are powerful, but how do they really fuse vision and language?
Our paper “FG-TRACER: Tracing Information Flow in Multimodal Large Language Models in Free-Form Generation” has been accepted for presentation at WACV 2026, Tucson, Arizona,USA. FG-TRACER introduces a framework to probe information flow between modalities, revealing distinct model- and task-dependent fusion patterns in LLaMA 3.2-Vision and LLaVA 1.5 across TextVQA, COCO 2014, and ChartQA.