Inference Time Causal Probing in LLMs
概要
arXiv:2605.07631v1 Announce Type: new Abstract: Causal probing methods aim to test and control how internal representations influence the behavior of generative models. In causal probing, an intervention modifies hidden states so that a property takes on a different value. Most existing approaches …