claim
For fair comparison in the Cleanlab benchmark, the underlying LLM for all hallucination detection methods is fixed to gpt-4o-mini.

Authors

Sources

Referenced by nodes (2)