measurement
Evaluation of uncertainty and confidence in language models uses AUROC, AUARC, NumSet, Deg, and EigV as metrics, and utilizes datasets including CoQA, TriviaQA, and Natural Questions.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (5)
- TriviaQA concept
- uncertainty concept
- AUROC concept
- confidence concept
- NaturalQuestions concept