Lyapunov-Certified Direct Switching Theory for Q-Learning
概要
arXiv:2604.19569v3 Announce Type: replace-cross Abstract: Q-learning is a fundamental algorithmic primitive in reinforcement learning. This paper develops a new framework for analyzing Q-learning from a switching-system viewpoint. In particular, we derive a direct stochastic switching-system repres…