measurement
The aggregated hallucination rates (%) for DeepSeek are 22.5 on TruthfulQA, 21.4 on HallucinationEval, and 20.1 on QAFactEval.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (1)
- TruthfulQA concept