claim
A central debate in the theoretical community concerns whether Reinforcement Learning (RL) truly instills new reasoning capabilities in Large Language Models or merely elicits latent abilities acquired during pre-training.

Authors

Sources

Referenced by nodes (2)