claim
Current AI training methods involve using gradient descent on petabytes of text to reshape networks with hundreds of billions of parameters, followed by fine-tuning that may suppress accurate self-reports about internal states.

Authors

Sources

Referenced by nodes (1)