claim
Hallucination detection metrics measure either the degree of hallucination in generated responses relative to given knowledge or their overlap with gold faithful responses, including Critic, Q² (F1, NLI), BERTScore, F1, BLEU, and ROUGE.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (6)
- hallucination detection concept
- ROUGE concept
- BERTScore concept
- BLEU concept
- natural language inference (NLI) concept
- F1 concept