measurement
The DeepSeek-R1 model experienced a performance decrease when using search augmentation, dropping from an 86.6% baseline to 84.3% (a -2.3% change).

Authors

Sources

Referenced by nodes (1)