claim
GPT-4's lower hallucination rate is more stable across prompts (lower Prompt Sensitivity) compared to GPT-3.5, as observed in the analysis and supported by prior findings from Liu et al. (2023).

Authors

Sources

Referenced by nodes (1)