Multi-Environment POMDPs with Finite-Horizon Objectives
概要
arXiv:2605.07537v1 Announce Type: new Abstract: Partially Observable Markov Decision Processes (POMDPs) are systems in which one agent interacts with a stochastic environment, and receives only partial information about the current state. In a multi-environment POMDP (MEPOMDP), the initial state is…