procedure
The SAGA system performs deduplication by grouping entities by type and using simple blocking to partition data into smaller buckets, followed by a matching model that computes similarity scores using machine-learning or rule-based methods, and finally utilizing correlation clustering to determine matching entities.
Authors
Sources
- Construction of Knowledge Graphs: State and Challenges - arXiv arxiv.org via serper
Referenced by nodes (3)
- entity resolution concept
- machine learning concept
- SAGA entity