measurement
Evaluation of factual consistency in summaries uses BERT-Precision and FactKB as metrics, and utilizes datasets including CNN-DM and XSUM for summarization, and MemoTrap and NQ-Swap for knowledge conflicts.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- factual consistency evaluation concept