reference
Previous attempts to explain BlackBox language models have utilized surrogate models like LIME (Ribeiro, Singh, and Guestrin 2016), visualization methods, and adversarial perturbations to input data (Chapman-Rounds et al. 2021).
Authors
Sources
- Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org via serper
Referenced by nodes (1)
- adversarial attack concept