measurement
The LARS uncertainty estimation technique is evaluated using Accuracy, Precision, Recall, and AUROC metrics on the TriviaQA, GSM8k, SVAMP, and Common-sense QA datasets.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper