claim
Multi-modal Question Answering (QA) involves answering questions over multi-modal data, with visual Question Answering (VQA) serving as a typical example.
Authors
Sources
- Large Language Models Meet Knowledge Graphs for Question ... arxiv.org via serper
Referenced by nodes (2)
- multi-modal question answering concept
- Visual Question Answering concept