Fact — reference — Knowledge Tree

The paper 'GaLore: memory-efficient LLM training by gradient low-rank projection' is published in the International Conference on Machine Learning, pp. 61121–61143, and is cited in sections 1 and 7.2.2 of 'A Survey on the Theory and Mechanism of Large Language Models'.

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (1)

International Conference on Machine Learning entity