procedure
The SAGA system performs deduplication by grouping entities by type and using simple blocking to partition data into smaller buckets, followed by a matching model that computes similarity scores using machine-learning or rule-based methods, and finally utilizing correlation clustering to determine matching entities.

Authors

Sources

Referenced by nodes (3)