reference
Evaluation metrics for hallucination rate in conversational settings include BLEU, ROUGE-1, ROUGE-2, and ROUGE-L, measured across settings such as original text, optimized system messages, full LLM weights, synthetic data, or mixtures of synthetic and reference data.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- BLEU concept