reference
The paper 'Inferring functionality of attention heads from their parameters' details methods for interpreting the internal mechanisms of attention heads in large language models.

Authors

Sources

Referenced by nodes (1)