claim
Mechanistic interpretability attempts to reverse-engineer specific circuits and features inside Large Language Models.

Authors

Sources

Referenced by nodes (1)