claim
The authors of 'Evaluating Evaluation Metrics — The Mirage of Hallucination Detection' observed that LLM-based evaluation, particularly using GPT-4, yields the best overall results for hallucination detection.

Authors

Sources

Referenced by nodes (2)