claim
GPT-4 significantly outperformed LLaMA 2 and DeepSeek in hallucination robustness, while DeepSeek provided moderate improvements over LLaMA 2, particularly in extrinsic hallucinations.

Authors

Sources

Referenced by nodes (1)