MLX TurboQuant preserves needle-in-haystack retrieval
https://github.com/Prince-Canuma
Same-day implementation (Mar 24). 6/6 NIAH at both 2.5-bit and 3.5-bit. Zero accuracy loss confirmed. Metal kernel WIP.