claim
Interpretability methods for Large Language Models are categorized into three broad groups: global, local, and mechanistic interpretability.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept