claim
The non-universal scaling exponents in large language models are linked to the intrinsic dimension of the data manifold.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept