reference
Concept-Based Intervenability in AI models leverages intermediate representations aligned with human-understandable concepts as the primary interface for interaction, often utilizing a 'concept bottleneck' layer to channel reasoning through these concepts.

Authors

Sources

Referenced by nodes (1)