claim
Vaswani et al. introduced transformer models in 2017, which serve as the foundation for modern LLMs such as BERT and GPT.

Authors

Sources

Referenced by nodes (3)