You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -54,7 +54,7 @@ From the authors of [GraphLab](https://github.com/jegonzal/PowerGraph) and [Turi
54
54
*Upcoming new features: image graph search!*
55
55
56
56
57
-
## Results on Key Datasets ([full results here](https://docs.google.com/spreadsheets/d/1wikhI6tWkbX_oofW_OynSelvEbWSeina6-ho3ZsMtX8/edit?usp=sharing))
57
+
## Results on Key Datasets ([full results here](https://bit.ly/3nyQ3ef))
58
58
We have thoroughly tested fastdup across various famous visual datasets. Ranging from pilar Academic datasets to Kaggle competitions. A key finding we have made using FastDup is that there are ~1.2M (!) duplicate images on the ImageNet-21K dataset, out of which 104K pairs belong both to the train and to the val splits (this amounts to 20% of the validation set). This is a new unknown result! Full results are below. * train/val splits are taken from https://github.com/Alibaba-MIIL/ImageNet21 .
0 commit comments