reference
The paper 'Demystify mamba in vision: a linear attention perspective' (arXiv:2405.16605) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding linear attention.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper