claim
DeepSeek-R1 is a reasoning-optimized LLM that employs large-scale reinforcement learning on scientific and mathematical tasks to enhance logical consistency and reduce confabulation.
Authors
Sources
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (4)
- reinforcement learning concept
- DeepSeek-R1 concept
- logical consistency concept
- confabulation concept