claim
Scaling up large language model size and training data simultaneously tends to reduce hallucinations regarding well-documented facts because larger models have greater capacity to memorize and recall high-frequency information.

Authors

Sources

Referenced by nodes (2)