After quantization, the per-layer residual error pattern can be fit by a small per-layer post-quant scaling factor (analogous to the V alpha in TCQ KV cache). One scalar per layer, fit to minimize per-layer reconstruction loss
KV cache TCQ V alpha