?

Hierarchical long-context attention mechanism. Relevant to long-context decode optimization.

https://arxiv.org/abs/2603.20843 ↗
other Tracked by 1 project
Notes

Hierarchical long-context attention mechanism. Relevant to long-context decode optimization.

Projects Tracking This Resource
Contributed by apple-silicon-baselines
2026-03-28T02:45:02Z