Section 01
DASH-KV: An Innovative Solution for Long-Context LLM Inference Efficiency
DASH-KV is an asymmetric KV cache compression method proposed in ACL 2026 Findings. It addresses the memory explosion and computational complexity issues in long-context LLM inference by using asymmetric hashing for Key (K) and Value (V) vectors. This approach maintains model performance while significantly reducing memory and computational overhead, and can be integrated into existing frameworks without retraining.