reference
The paper 'Scan and snap: understanding training dynamics and token composition in 1-layer transformer' was published in Advances in Neural Information Processing Systems.

Authors

Sources

Referenced by nodes (2)