Different reasoning lengths need different compression levels. S1 (4-6 sentences, ≥800ch), S2 (2-3 sentences, ≥400ch), S3 (1 sentence, <400ch) — all generated in one LLM call, code picks appropriate tier based on original reasoning length
117s in 7 rounds. Tiered CR working — 4/6 rounds saving context (12-54%). Short blocks get S3, medium get S2. Two marginal expansions (-0.9%, -8.7%).