Fact — measurement — Knowledge Tree

Removing search functionality from the AMG-RAG system drops accuracy to 67.16%, and removing Chain-of-Thought (CoT) reasoning drops accuracy to 66.69% on the MEDQA benchmark.

Authors

Person: Not available Organization: arXiv
Bridging the Gap Between LLMs and Evolving Medical Knowledge

Sources

Bridging the Gap Between LLMs and Evolving Medical Knowledge arxiv.org arXiv via serper

Referenced by nodes (3)

chain-of-thought concept
AMG-RAG concept
MEDQA concept