claim
BERT introduced bidirectional training, which allows the model to analyze context from both sides to improve language understanding.

Authors

Sources

Referenced by nodes (1)