Fact — claim — Knowledge Tree

Different attention patterns can be learned to generate bounded outputs, and interpretability via local ("myopic") analysis can be provably misleading on Transformers, according to Wen et al. (2023).

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (2)

Transformers concept
interpretability concept