Are Flat Minima an Illusion?
概要
arXiv:2605.05209v1 Announce Type: cross Abstract: Neural networks that land in flat regions of the loss landscape tend to generalise better than those in sharp regions. Sharpness-Aware Minimisation exploits this to improve generalisation. But function-preserving reparameterisation can inflate the H…