TTFB (Time to First Byte)

Latency from request start until the first byte is received. Impacted by network, cold starts, and model prefill.