Toward Scalable Audio Description Quality Control: A Workflow for Evaluating Human and VLM Raters
概要
arXiv:2602.01390v2 Announce Type: replace-cross Abstract: Digital video is central to communication, education, and entertainment, but without audio description (AD), blind and low-vision users are excluded. While crowdsourced platforms and vision-language models (VLMs) expand AD production, qualit…