claim
Distribution-Based Sensitivity Analysis (DBSA) enables users to perform quick, plug-and-play visual exploration of how language models rely on specific input tokens, potentially identifying sensitivities overlooked by existing interpretability methods.
Authors
Sources
- Track: Poster Session 3 - aistats 2026 virtual.aistats.org via serper
Referenced by nodes (1)
- Language Model concept