Multimodal Fact-Level Attribution for Verifiable Reasoning
概要
arXiv:2602.11509v2 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) are increasingly used for real-world tasks involving multi-step reasoning and long-form generation, where reliability requires grounding model outputs in heterogeneous input sources and verifying indi…