measurement
Adding a 'not sure' response option to Large Language Models improves hallucination detection precision by up to 38% in the MedHallu benchmark.

Authors

Sources

Referenced by nodes (3)