reference
The paper 'Understanding scaling laws with statistical and approximation theory for transformer neural networks on intrinsically low-dimensional data' (Advances in Neural Information Processing Systems 37) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding scaling laws.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper