reference
The paper 'On the generalization ability of unsupervised pretraining' was published in the International Conference on Artificial Intelligence and Statistics, pp. 4519–4527.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- generalization concept