reference
Theoretical analysis of mixed-data training for large language models is rooted in classic literature on Domain Adaptation, specifically citing work by Ben-David et al. (2010), Mansour et al. (2008), and Courty et al. (2016).

Authors

Sources

Referenced by nodes (1)