claim
The Cleanlab hallucination detection benchmark evaluates methods across four public Context-Question-Answer datasets spanning different RAG applications.

Authors

Sources

Referenced by nodes (3)