claim
For fair comparison in the Cleanlab benchmark, the underlying LLM for all hallucination detection methods is fixed to gpt-4o-mini.
Authors
Sources
- Benchmarking Hallucination Detection Methods in RAG - Cleanlab cleanlab.ai via serper
Referenced by nodes (2)
- Cleanlab entity
- gpt-4o-mini concept