reference
Sundararajan et al. (2017) proposed Integrated Gradients, an axiomatic attribution method that assigns a contribution score to each input feature for token-level importance analysis in neural NLP and LLM outputs.

Authors

Sources

Referenced by nodes (1)