claim
Performance gains in large language models are achieved not only by scaling data and model size during training, but also by increasing test-time computation, such as allowing the model to perform recurrent or iterative reasoning.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- iterative reasoning concept