claim
Multi-modal Question Answering (QA) involves performing question answering over data and knowledge that includes multiple modalities, such as text, audio, images, and video.
Authors
Sources
- Large Language Models Meet Knowledge Graphs for Question ... arxiv.org via serper
Referenced by nodes (1)
- multi-modal question answering concept