reference
The HallusionBench benchmark evaluates Large Vision-Language Models (LVLMs) such as GPT-4V(Vision), Gemini Pro Vision, Claude 3, and LLaVA-1.5 by emphasizing nuanced understanding and interpretation of visual data.

Authors

Sources

Referenced by nodes (1)