claim
SLM-as-a-judge approaches for hallucination detection often fail in complex use cases, particularly when the context and answer are large and involve layers of reasoning.

Authors

Sources

Referenced by nodes (2)