reference
The paper 'Selective induction heads: how transformers select causal structures in context' was presented at The Thirteenth International Conference on Learning Representations.

Authors

Sources

Referenced by nodes (4)