KV Offloading

Moving key/value cache tensors to CPU or disk tiers to serve longer contexts under memory limits.