claim
Olsson et al. (2022) identified induction heads as specific attention heads whose learned algorithm underlies a large fraction of in-context learning in Large Language Models.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- In-Context Learning concept