The Endogeneity of Miscalibration: Impossibility and Escape in Scored Reporting
概要
arXiv:2605.07671v1 Announce Type: cross Abstract: Eliciting truthful reports from autonomous agents is a core problem in scalable AI oversight: a principal scores the agent's report using a strictly proper scoring rule, but the agent also benefits from the report through a non-accuracy channel (app…