claim
Distribution-Based Sensitivity Analysis (DBSA) enables users to perform quick, plug-and-play visual exploration of how language models rely on specific input tokens, potentially identifying sensitivities overlooked by existing interpretability methods.

Authors

Sources

Referenced by nodes (1)