reference
Marconato et al. (2024) established an 'all-or-none' identifiability theorem, which proves that linear properties in Large Language Models either hold in all or in none of the distributionally equivalent models under specific conditions.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept