claim
Multimodal Knowledge Graph construction aims to integrate heterogeneous modalities, including text, images, audio, and video, into unified, structured representations to enable richer reasoning and cross-modal alignment.
Authors
Sources
- LLM-empowered knowledge graph construction: A survey - arXiv arxiv.org via serper
Referenced by nodes (3)
- multimodal knowledge graphs concept
- images concept
- text concept