claim
Frontier AI models report experiencing "an injected thought" or "something unexpected" in real-time when researchers inject specific concepts into the model's neural activity, indicating introspection in a functional sense.

Authors

Sources

Referenced by nodes (1)