Inference Cost
The computational and financial cost of running a trained model to generate predictions or outputs. Factors include model size, token count, hardware requirements, and API pricing. Optimizing inference cost is crucial for production AI applications.