reference
The paper 'Theoretical insights into fine-tuning attention mechanism: generalization and optimization' was published in the Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI-25), edited by J. Kwok, pages 6830–6838.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper