claim
Multi-modal Question Answering (QA) involves answering questions over multi-modal data, with visual Question Answering (VQA) serving as a typical example.

Authors

Sources

Referenced by nodes (2)