Replacing GPTQ's per-column scalar quantizer with turbo as the inner block quantizer composes well — GPTQ's Hessian-corrected weights pre-align for turbo's rounding, FWHT Gaussianization makes the Lloyd-Max grid usable on weights it normally clips
GPTQ + turbo at 4-bit is much better than either alone (gptq_q4=22.60, turbo4=24.14). Still ~1.6 PPL above Q4_K_M but at 0.7 fewer bits.
act_order is essentially neutral when the inner quantizer is turbo — the per-tile FWHT already absorbs column-ordering effects. Default off for this pipeline.