reference
VisDom, introduced by Suri et al. (2024), performs multi-document Question Answering by integrating and fusing multi-modal knowledge and leveraging Chain-of-thought (CoT) based reasoning.

Authors

Sources

Referenced by nodes (1)