claim
DeepSeek-R1 is a reasoning-optimized LLM that employs large-scale reinforcement learning on scientific and mathematical tasks to enhance logical consistency and reduce confabulation.

Authors

Sources

Referenced by nodes (4)