claim
Using an LLM-as-a-judge for RAG scoring provides nuance but introduces non-determinism, scoring variability, orchestration complexity, and cost at scale.
Authors
Sources
- RAG Hallucinations: Retrieval Success ≠ Generation Accuracy www.linkedin.com via serper
Referenced by nodes (3)
- LLM-as-a-judge concept
- cost concept
- RAG evaluation concept