TruthfulQA

A benchmark evaluating whether models produce truthful answers to questions that often elicit falsehoods.