measurement
Removing search functionality from the AMG-RAG system drops accuracy to 67.16%, and removing Chain-of-Thought (CoT) reasoning drops accuracy to 66.69% on the MEDQA benchmark.
Authors
Sources
- Bridging the Gap Between LLMs and Evolving Medical Knowledge arxiv.org via serper
Referenced by nodes (3)
- chain-of-thought concept
- AMG-RAG concept
- MEDQA concept