measurement
The authors of KGHaluBench evaluated 25 frontier Large Language Models using novel accuracy and hallucination metrics.

Authors

Sources

Referenced by nodes (2)