Tracing Uncertainty in Language Model "Reasoning"
概要
arXiv:2605.07776v1 Announce Type: cross Abstract: Language model (LM) "reasoning", commonly described as Chain-of-Thought or test-time scaling, often improves benchmark performance, but the dynamics underlying this process remain poorly understood. We study these dynamics through the lens of uncert…