measurement
The DeepSeek-R1 model experienced a performance decrease when using search augmentation, dropping from an 86.6% baseline to 84.3% (a -2.3% change).
Authors
Sources
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (1)
- DeepSeek-R1 concept