VideoRouter: Query-Adaptive Dual Routing for Efficient Long-Video Understanding
概要
arXiv:2605.05848v1 Announce Type: cross Abstract: Video large multimodal models increasingly face a scalability bottleneck: long videos produce excessively long visual-token sequences, which sharply increase memory and latency during inference. While existing compression methods are effective in sp…