EA-WM: Event-Aware Generative World Model with Structured Kinematic-to-Visual Action Fields
概要
arXiv:2605.06192v1 Announce Type: cross Abstract: Pretrained video diffusion models provide powerful spatiotemporal generative priors, making them a natural foundation for robotic world models. While recent world-action models jointly optimize future videos and actions, they predominantly treat vid…