Automated Evaluation can Distinguish the Good and Bad AI Responses to Patient Questions about Hospitalization
概要
arXiv:2510.00436v2 Announce Type: replace Abstract: Automated approaches to answer patient-posed health questions are rising, but selecting among systems requires reliable evaluation. The current gold standard for evaluating the free-text artificial intelligence (AI) responses--human expert review-…