claim
General-purpose LLMs like GPT-4 outperform specialized medical fine-tuned models in hallucination detection tasks when no extra context is provided.

Authors

Sources

Referenced by nodes (3)