reference
The paper 'FaithDial: A faithful benchmark for information-seeking dialogue' by Dziri et al. (2022) introduces a benchmark designed to evaluate the faithfulness of information-seeking dialogue systems, published in the Transactions of the Association for Computational Linguistics.
Authors
Sources
- Re-evaluating Hallucination Detection in LLMs - arXiv arxiv.org via serper