Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions
概要
arXiv:2605.07984v1 Announce Type: cross Abstract: We study planning site formation in language models -- where internal representations of structurally-constrained future tokens form during the forward pass, and whether they causally drive generation. Using rhyming-couplet completion as a clean tes…