reference
The paper 'Towards a theoretical understanding of synthetic data in llm post-training: a reverse-bottleneck perspective' provides a theoretical framework for understanding the role of synthetic data in post-training large language models.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- Synthetic data concept