Quantizing With Randomized Hadamard Transforms: Efficient Heuristic Now Proven
概要
arXiv:2605.06014v1 Announce Type: cross Abstract: Uniform random rotations (URRs) are a common preprocessing step in modern quantization approaches used for gradient compression, inference acceleration, KV-cache compression, model weight quantization, and approximate nearest-neighbor search in vect…