claim
The Cleanlab RAG benchmark datasets are composed of entries containing a user query, retrieved context, an LLM-generated response, and a binary annotation indicating whether the response was correct.

Authors

Sources

Referenced by nodes (1)