Fact — claim — Knowledge Tree

Larger language models like Med-Gemini and GPT-4 achieve the highest accuracy and F1 scores on the MEDQA benchmark but require significantly larger parameter sizes.

Authors

Person: Not available Organization: arXiv
Bridging the Gap Between LLMs and Evolving Medical Knowledge

Sources

Bridging the Gap Between LLMs and Evolving Medical Knowledge arxiv.org arXiv via serper

Referenced by nodes (2)

GPT-4 concept
MEDQA concept