claim
The INSIDE framework and EigenScore metric were evaluated on LLaMA and OPT models across question answering benchmarks, improving detection compared with uncertainty- and lexical-similarity baselines.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- Eigenscore concept