claim
In large language models, large initial errors cause the model to reach a maximum divergence ceiling quickly, resulting in a complete loss of calibration relative to the correct distribution.

Authors

Sources

Referenced by nodes (1)