claim
Research presented at EMNLP 2025 found that alignment-tuned Large Language Models produce more faithful explanations than base models, and that faithfulness and plausibility are positively correlated.

Authors

Sources

Referenced by nodes (2)