claim
Larger language models trained on more data tend to hallucinate less on high-frequency facts because their stronger signal for well-attested entities reduces training data issues and knowledge gaps.

Authors

Sources

Referenced by nodes (1)