reference
The research paper 'Decoupled weight decay regularization' was published in the Findings of the Association for Computational Linguistics ACL 2024, pp. 11065–11082, and cited in section 2.3.1 of the survey.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper