claim
Reinforcement learning provides the most robust defense against modality conflict in Multimodal Large Language Models by training the model to prioritize visual evidence over misleading textual cues, compared to prompt engineering and supervised fine-tuning.

Authors

Sources

Referenced by nodes (2)