claim
The MedHallu benchmark exposes current limitations in Large Language Model hallucination detection.

Authors

Sources

Referenced by nodes (2)