Fact — reference — Knowledge Tree

Evaluation metrics for hallucination rate in conversational settings include BLEU, ROUGE-1, ROUGE-2, and ROUGE-L, measured across settings such as original text, optimized system messages, full LLM weights, synthetic data, or mixtures of synthetic and reference data.

Authors

Person: Not available Organization: GitHub
EdinburghNLP/awesome-hallucination-detection - GitHub

Sources

EdinburghNLP/awesome-hallucination-detection - GitHub github.com GitHub via serper

Referenced by nodes (1)

BLEU concept