reference
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova introduced the BERT model for deep bidirectional transformer-based language understanding in a 2018 arXiv preprint.

Authors

Sources

Referenced by nodes (2)