arXiv cs.AI by Synapse Flow 編集部

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

概要

arXiv:2605.06207v1 Announce Type: cross Abstract: Most discrete visual tokenizers rely on a default design: every position in the sequence shares the same codebook. Researchers try to scale the codebook size $K$ to get better reconstruction performance. Such a constant-codebook design hits a fundam…

元記事を読む →

関連記事