CAMEL: Confidence-Gated Reflection for Reward Modeling
概要
arXiv:2602.20670v2 Announce Type: replace-cross Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Existing methods predominantly follow two paradigms: scalar discriminative preference models, which are efficient but lack interpretability, and …