Best AI Tools
Tools
Top 100
AI News
Learn
Compare
Partner
Submit Tool
AI Glossary
/
Benchmark (AI Benchmark)
Benchmark (AI Benchmark)
A standardized test or dataset used to evaluate model quality, robustness, and performance. Examples include MMLU, HELM, and custom task‑specific evals.
Related terms
Evaluation
Evals
Latency
View on glossary index