procedure
The VisualSem image cleaning process applies four filters: checking for valid image files, removing duplicated images via SHA1 hashing, using a ResNET-based binary classifier to remove non-photographic images, and leveraging OpenAI’s CLIP to remove images that do not minimally match any of the node glosses.

Authors

Sources

Referenced by nodes (1)