measurement
Evaluation of hallucinations uses the percentage of wrong answers and cases where the model knows it is wrong (Snowballed Hallucinations) as metrics, and utilizes datasets including Primality Testing, Senator Search, and Graph Connectivity.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- hallucination concept