measurement
Using GraphEval in conjunction with state-of-the-art natural language inference (NLI) models improves balanced accuracy on various hallucination benchmarks compared to using raw NLI models alone.

Authors

Sources

Referenced by nodes (2)