EgoPro-Bench: Benchmarking Personalized Proactive Interaction in Egocentric Video Streams
概要
arXiv:2605.07299v1 Announce Type: cross Abstract: Existing Multimodal Large Language Models (MLLMs) remain primarily reactive, failing to continuously perceive environments or proactively assist users. While emerging benchmarks address proactivity, they are largely confined to alert scenarios, negl…