reference
The FEVER benchmark, introduced by Thorne et al. in 2018, utilizes a Natural Language Inference (NLI) model to evaluate whether a response contains, contradicts, or does not mention the provided evidence.

Authors

Sources

Referenced by nodes (2)