reference
The paper 'Resurrecting recurrent neural networks for long sequences' was published in the International Conference on Machine Learning, pp. 26670–26698.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper