reference
The research paper 'RegMix: data mixture as regression for language model pre-training' was published in ACM Computing Surveys 55 (9), pp. 1–35, and cited in section 6.2.2 of the survey.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- regression concept