claim
The authors employ the Area Under the Receiver Operating Characteristic curve (AUROC) and the Area Under the Precision-Recall curve (PR-AUC) as primary evaluation metrics for hallucination detection, as both provide threshold-independent evaluations of ranking performance.
Authors
Sources
- Re-evaluating Hallucination Detection in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- hallucination detection concept