reference
Li et al. (2025b) theoretically analyzed 'Task Arithmetic' and proved that, under suitable assumptions, linear operations such as addition and negation can successfully edit knowledge in nonlinear Transformers and generalize to out-of-domain tasks.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Transformers concept