measurement
Evaluation methods for hallucination detection utilize AUROC as a metric across datasets including PAWS, XSum, QAGS, FRANK, SummEval, BEGIN, Q^2, DialFact, FEVER, and VitaminC.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper