reference
The paper 'How do transformers learn topic structure: towards a mechanistic understanding' was published in the International Conference on Machine Learning, pp. 19689–19729.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Transformers concept
- International Conference on Machine Learning entity