claim
KGHaluBench statistically estimates the difficulty of each question, aggregates for the assessment, and scales the accuracy accordingly to ensure reliable evaluation.

Authors

Sources

Referenced by nodes (1)