Best AI Tools
Tools
Top 100
AI News
Learn
Compare
Partner
Submit Tool
AI Glossary
/
KV Cache (Key‑Value Cache)
KV Cache (Key‑Value Cache)
Cached attention key/value tensors reused across decoding steps to avoid recomputation and reduce latency and cost.
Related terms
KV Cache Eviction
Latency
Speculative Decoding
View on glossary index