Immediate per-tool-type summarization at ingestion (OpenDev pattern) saves more tokens than pressure-based approach. Model can re-fetch if needed. Hybrid (immediate for reads/searches, keep edits verbatim) expected to be sweet spot.
https://arxiv.org/abs/2603.05344