Fact — measurement — Knowledge Tree

Seddik et al. (2024) concluded that to maintain model stability, the amount of synthetic data used in training must be considerably smaller than the amount of real data in the training mix.

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (1)

Synthetic data concept