q_norm/k_norm RMSNorm tensors are tiny but sit in the attention path — sensitivity should be asymmetric to parameter count
q8 norms are free (within stderr ±0.16). q4 norms are real degradation. Lesson — naive symmetric absmax wipes out k_norm because layer 0 has max=96.5 vs typical 1-3; needs per-group asymmetric. Cross-arch lesson on outlier-heavy RMSNorm tensors.