Fact — claim — Knowledge Tree

Traditional n-gram overlap measures like ROUGE are limited in their ability to reliably assess factual consistency in AI systems.

Authors

Person: Not available Organization: arXiv
Re-evaluating Hallucination Detection in LLMs - arXiv

Sources

Re-evaluating Hallucination Detection in LLMs - arXiv arxiv.org arXiv via serper

Referenced by nodes (2)

artificial intelligence concept
ROUGE concept