reference
The paper 'Deepseek-r1: incentivizing reasoning capability in llms via reinforcement learning' (arXiv:2501.12948) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding reasoning capabilities.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (5)
- Large Language Models concept
- arXiv entity
- reinforcement learning concept
- DeepSeek-R1 concept
- reasoning capabilities concept