Fact — claim — Knowledge Tree

General-purpose large language models often outperform specialized medical models in hallucination detection tasks according to experiments conducted for the MedHallu benchmark.

Authors

Person: Not available Organization: The Moonlight
[Literature Review] MedHallu: A Comprehensive Benchmark for ...

Sources

[Literature Review] MedHallu: A Comprehensive Benchmark for ... www.themoonlight.io The Moonlight via serper

Referenced by nodes (3)