Fact — claim — Knowledge Tree

Research by Honovich et al. (2022) and Kang et al. (2024) indicates that the ROUGE evaluation metric is poorly aligned with human judgments of factual correctness in AI systems.

Authors

Person: Not available Organization: arXiv
Re-evaluating Hallucination Detection in LLMs - arXiv

Sources

Re-evaluating Hallucination Detection in LLMs - arXiv arxiv.org arXiv via serper

Referenced by nodes (3)