Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising
概要
arXiv:2604.26694v2 Announce Type: replace-cross Abstract: We propose X-WAM, a Unified 4D World Model that unifies real-time robotic action execution and high-fidelity 4D world synthesis (video + 3D reconstruction) in a single framework, addressing the critical limitations of prior unified world mod…