reference
The paper 'Transformers are rnns: fast autoregressive transformers with linear attention' was published in the International Conference on Machine Learning, pages 5156–5165, and is cited in section 3.2.3 of 'A Survey on the Theory and Mechanism of Large Language Models'.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Transformers concept
- International Conference on Machine Learning entity