claim
Independent researcher Christopher Ackerman found evidence of limited but real introspective abilities in AI models by testing whether models can access and use internal confidence signals without relying on self-reports, noting these abilities grow stronger in more capable models.

Authors

Sources

Referenced by nodes (1)