claim
Several research works relate the optimization objective of Transformers to energy-based principles, including Ramsauer et al. (2020), Hoover et al. (2023), Hu et al. (2023a), Wu et al. (2023), Ren et al. (2025), and Hu et al. (2025).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Transformers concept