reference
The paper 'Looped transformers as programmable computers' was published in the International Conference on Machine Learning, pp. 11398–11442.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper