procedure
The authors of the study apply derived weights to calculate question difficulty scores to improve assessment fairness and address entity-popularity bias introduced by the question-generation mechanism in their benchmark.
Authors
Sources
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (1)
- KGHaluBench concept