arXiv cs.AI by Synapse Flow 編集部

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

概要

arXiv:2605.05662v1 Announce Type: cross Abstract: Current LLM safety benchmarks are predominantly English-centric and often rely on translation, failing to capture country-specific harms. Moreover, they rarely evaluate a model's ability to detect culturally embedded sensitivities as distinct from u…

元記事を読む →

関連記事