measurement
Medical-specific models, including pmc-llama, medalpaca, and alpacare, consistently exhibit lower semantic similarity scores ranging from 0.1 to 0.4 alongside higher hallucination rates.

Authors

Sources

Referenced by nodes (1)