Fact — claim — Knowledge Tree

Most hallucination detection methods, excluding the basic Self-Evaluation technique, struggled to provide significant improvements over random guessing when evaluated on the FinanceBench dataset.

Authors

Person: Not available Organization: Cleanlab
Benchmarking Hallucination Detection Methods in RAG - Cleanlab

Sources

Benchmarking Hallucination Detection Methods in RAG - Cleanlab cleanlab.ai Cleanlab via serper

Referenced by nodes (2)

LLM-as-a-judge concept
FinanceBench concept