reference
DeepSeek-AI published the DeepSeek-R1 technical report in 2025, detailing the use of reinforcement learning to incentivize reasoning capabilities in large language models.
Authors
Sources
- A Comprehensive Benchmark and Evaluation Framework for Multi ... arxiv.org via serper
Referenced by nodes (5)
- Large Language Models concept
- reinforcement learning concept
- DeepSeek-R1 concept
- DeepSeek-AI entity
- reasoning capabilities concept