arXiv cs.AI by Synapse Flow 編集部

MOSAIC-Bench: Measuring Compositional Vulnerability Induction in Coding Agents

概要

arXiv:2605.03952v1 Announce Type: cross Abstract: Coding agents often pass per-prompt safety review yet ship exploitable code when their tasks are decomposed into routine engineering tickets. The challenge is structural: existing safety alignment evaluates overt requests in isolation, leaving model…

元記事を読む →

関連記事