VLM

Alias for Vision‑Language Model (VLM), a multimodal model for image/video + text.