claim
The automated fact verification framework used in KGHaluBench may make mistakes, such as rejecting valid responses or scoring misaligned ones, despite achieving substantial agreement with human judgment.

Authors

Sources

Referenced by nodes (1)