Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models
The article discusses a new approach to extending cultural heritage knowledge graphs using language and vision models. The authors introduce a multimodal knowledge graph, WJoconde, which incorporates both textual and image data related to French cultural heritage. They also present a framework for enhancing knowledge graphs through automated data extraction and validation processes.
- ▪The proposed knowledge graph, WJoconde, integrates textual and image information.
- ▪The authors developed three variants of WJoconde to support downstream research.
- ▪A comprehensive benchmark for knowledge graph completion methods was created using their dataset.
Opening excerpt (first ~120 words) tap to expand
Computer Science > Artificial Intelligence arXiv:2605.17669 (cs) [Submitted on 17 May 2026] Title:Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models Authors:Yang Zhang, Nada Mimouni, Jean-Claude Moissinac, Fayçal Hamdi View a PDF of the paper titled Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models, by Yang Zhang and 3 other authors View PDF HTML (experimental) Abstract:The preservation and interpretation of cultural heritage increasingly rely on digital technologies, among which Knowledge Graphs (KGs) stand out for their ability to structure vast amounts of data. However, the construction and expansion of these KGs often face challenges due to the diverse and complex nature of cultural heritage information.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.