Soft Deterministic Policy Gradient with Gaussian Smoothing
概要
arXiv:2605.06228v1 Announce Type: cross Abstract: Deterministic policy gradient (DPG) is widely utilized for continuous control; however, it inherently relies on the differentiability of the critic with respect to the action during policy updates. This assumption is violated in practical control pr…