claim
Transformers have a quadratic computational cost, which acts as an obstacle to their broad deployment in real-world settings, according to Vaswani et al. (2017a).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Transformers concept