claim
Using an LLM-as-a-judge for RAG scoring provides nuance but introduces non-determinism, scoring variability, orchestration complexity, and cost at scale.

Authors

Sources

Referenced by nodes (3)