reference
Evaluation metrics for hallucination rate in conversational settings include BLEU, ROUGE-1, ROUGE-2, and ROUGE-L, measured across settings such as original text, optimized system messages, full LLM weights, synthetic data, or mixtures of synthetic and reference data.

Authors

Sources

Referenced by nodes (1)