claim
GLSim outperforms competitive baselines across multiple Large Vision-Language Models (LLaVA-1.5, MiniGPT-4, Shikra, InstructBLIP, Qwen2.5-VL) without requiring external supervision or judge models.

Authors

Sources

Referenced by nodes (1)