Fact — measurement — Knowledge Tree

Evaluation of faithfulness between predicted responses and ground-truth knowledge uses Critic, Q², BERT F1, and F1 as metrics, and utilizes datasets including Wizard-of-Wikipedia (WoW), DSTC9 and DSTC11 extensions of MultiWoZ 2.1, and FaithDial.

Authors

Person: Not available Organization: GitHub
EdinburghNLP/awesome-hallucination-detection - GitHub

Sources

EdinburghNLP/awesome-hallucination-detection - GitHub github.com GitHub via serper

Referenced by nodes (2)

F1 concept
faithfulness concept