reference
The paper 'On zero-initialized attention: optimal prompt and gating factor estimation' was published in the Proceedings of the 42nd International Conference on Machine Learning, Proceedings of Machine Learning Research, Vol. 267, pp. 13713–13745.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- International Conference on Machine Learning event
- prompt concept