Fact — reference — Knowledge Tree

The TruthfulQA benchmark evaluates AI models using MC1, MC2, and MC3 multiple-choice scores, and for open-ended generation, it uses %Truth, %Info, %Truth*Info, and %Reject metrics.

Authors

Person: Not available Organization: GitHub
EdinburghNLP/awesome-hallucination-detection - GitHub

Sources

EdinburghNLP/awesome-hallucination-detection - GitHub github.com GitHub via serper

Referenced by nodes (2)

TruthfulQA concept
%Truth concept