reference
The paper 'Revisiting transformers through the lens of low entropy and dynamic sparsity' is an arXiv preprint (arXiv:2504.18929) cited in section 3.2.3 of 'A Survey on the Theory and Mechanism of Large Language Models'.

Authors

Sources

Referenced by nodes (1)