reference
The paper 'Understanding chain-of-thought in LLMs through information theory' was published in the Proceedings of the 42nd International Conference on Machine Learning, Vol. 267, pp. 59784–59811, edited by A. Singh, M. Fazel, D. Hsu, S. Lacoste-Julien, F. Berkenkamp, T. Maharaj, K. Wagstaff, and J. Zhu.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (3)
- chain-of-thought concept
- International Conference on Machine Learning entity
- information theory concept