Softmax amplifies K-side errors more than V-side errors; the gap should grow with context length
k_proj sensitivity rises 1.81× from 2K to 16K (predicted by softmax amplification). v_proj stays flat. o_proj quietly loses importance at long context. k_proj should be the default protected role.