reference
Conversational QA models are evaluated using fine-tuning on MNLI, SNLI, FEVER, PAWS, ScTail, and VitaminC, while summarisation models are evaluated using fine-tuning on ANLI and XNLI.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- fever concept