measurement
The authors of KGHaluBench evaluated 25 frontier models using novel accuracy and hallucination metrics to gain insight into the knowledge factors causing hallucinations across different model sizes.

Authors

Sources

Referenced by nodes (1)