reference
Benchmarks such as TruthfulQA (Lin et al., 2022), HallucinationEval (Wu et al., 2023), and RealToxicityPrompts (Gehman et al., 2020) were introduced to assess hallucination bias across models and tasks.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (1)
- TruthfulQA concept