PLOT: Progressive Localization via Optimal Transport in Neural Causal Abstraction
概要
arXiv:2605.06979v1 Announce Type: cross Abstract: Causal abstraction offers a principled framework for mechanistic interpretability, aligning a high-level causal model with the low-level computation realized by a neural network through counterfactual intervention analysis. Existing methods such as …