GGUF

A model file format optimized for efficient CPU/GPU inference in the llama.cpp ecosystem.

Related terms