FWHT rotation is essential for turbo3 quality
ROTATION IS ESSENTIAL. Provides 0.39 PPL gain (6.24→5.83), norm correction adds 0.29 more (6.52→6.24). Together they make turbo3 beat q8_0. Implementation: forward rotation = sign1 multiply → in-place FWHT → sign2 multiply. Inverse = sign2 → FWHT → sign1 (FWHT is self-inverse). Sign arrays are static constants (128 values each, from turbo-wht.h). WARNING: previous session incorrectly concluded rotation hurts — that was from broken double-rotation (inline FA + graph-level both active). DO NOT DISABLE ROTATION.
Mode 3 (last4) = Mode 5 (first2+last2) in PPL at 5.8091. The last 2 layers are the critical ones — protecting them dominates. First 4 layers contribute less than last 4. Mode 5 is the max-compression sweet spot: only 4 layers q8_0, ~4.2x compression, -0.49% PPL. Context recommendations: up to 65K use LA-1 (best PPL), 65K-128K use LA-5 (LA-1 OOMs), 128K+ use uniform turbo3.
Asymmetric layer-adaptive does NOT help. Promoting only K or only V gives identical PPL (5.8390), both worse than uniform turbo3. Norm correction mismatch between turbo and q8_0 within the same layer hurts quality. K vs V makes no difference. Both must be promoted together (mode 2 at 5.8140) for the improvement to work.