claim
Embedding similarity metrics for RAG evaluation are deterministic and cheap but rigid because they reward matching the ground truth rather than actual correctness, and improvements can appear worse if the ground truth is narrow.

Authors

Sources

Referenced by nodes (2)