claim
GPT-4 significantly outperformed LLaMA 2 and DeepSeek in hallucination robustness, while DeepSeek provided moderate improvements over LLaMA 2, particularly in extrinsic hallucinations.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (1)
- extrinsic hallucination concept