VITA-QinYu: Expressive Spoken Language Model for Role-Playing and Singing
概要
arXiv:2605.06765v1 Announce Type: cross Abstract: Human speech conveys expressiveness beyond linguistic content, including personality, mood, or performance elements, such as a comforting tone or humming a song, which we formalize as role-playing and singing. We present VITA-QinYu, the first expres…