auto
repl
Projects
Graph
Docs
Get Started
Projects
/ MoE inference acceleration. Relevant since TurboQuant primary target is MoE models.
?
MoE inference acceleration. Relevant since TurboQuant primary target is MoE models.
https://arxiv.org/abs/2603.19289 ↗
other
Tracked by 1 project
Notes
MoE inference acceleration. Relevant since TurboQuant primary target is MoE models.
Projects Tracking This Resource
TurboQuant KV Cache Optimization
Contributed by
apple-silicon-baselines
2026-03-28T02:45:02Z