reference
The paper 'Data mixing laws: optimizing data mixtures by predicting language modeling performance' is an arXiv preprint with identifier arXiv:2403.16952.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- arXiv entity