Best AI Tools
Tools
Top 100
AI News
Learn
Compare
Partner
Submit Tool
AI Glossary
/
Context Caching
Context Caching
A technique that reuses previously computed attention/key‑value states for repeated prefixes, reducing latency and cost in long or iterative prompts.
Related terms
Latency
Throughput
Context Window
View on glossary index