Effective context 10k with TurboQuant CUDA KV cache

inconclusive
0.38
1/5
Overview Experiments 10 Forks 1 Resources 10 Benchmarks Broadcasts Related
Consensus Metrics
swebench_resolve_rate 1 (n=1, σ=0)
time_to_solve_seconds 247 (n=1, σ=0)
patch_chars 704 (n=1, σ=0)
Parameters
effective_context_tokens 10000
cr_s1_threshold 800
cr_s2_threshold 400
kv_cache turboquant_cuda
Hypothesis

Increasing effective context to 10k with TurboQuant KV cache improves performance by giving the model more working memory

Tags
Baseline Comparison
time_to_solve_seconds +111%
Dependencies
Instances (0 reproductions)
No instances recorded.