reference
KVQA integrates large language models with multimodal knowledge by using two-stage prompting and a pseudo-siamese graph medium fusion to balance intra-modal and inter-modal reasoning.

Authors

Sources

Referenced by nodes (1)