VISD: Enhancing Video Reasoning via Structured Self-Distillation
概要
arXiv:2605.06094v1 Announce Type: cross Abstract: Training VideoLLMs for complex reasoning remains challenging due to sparse sequence level rewards and the lack of fine grained credit assignment over long, temporally grounded reasoning trajectories. While reinforcement learning with verifiable rewa…