procedure
The MedHallu benchmark generates hallucinated answers through a controlled pipeline to create a dataset for binary hallucination detection.

Authors

Sources

Referenced by nodes (2)