perspective
Future iterations of the HalluLens benchmark could be strengthened by including diverse models such as Gemini and incorporating human raters.

Authors

Sources

Referenced by nodes (1)