claim
A central debate in the theoretical community concerns whether Reinforcement Learning (RL) truly instills new reasoning capabilities in Large Language Models or merely elicits latent abilities acquired during pre-training.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- reinforcement learning concept