reference
The paper 'Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning' was published as an arXiv preprint in 2025.

Authors

Sources

Referenced by nodes (4)