Skip to content

Commit 9cfed95

Browse files
authored
Update README.md
1 parent bd4f173 commit 9cfed95

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@ From the authors of [GraphLab](https://github.com/jegonzal/PowerGraph) and [Turi
5454
*Upcoming new features: image graph search!*
5555

5656

57-
## Results on Key Datasets
57+
## Results on Key Datasets ([full results here](https://docs.google.com/spreadsheets/d/1wikhI6tWkbX_oofW_OynSelvEbWSeina6-ho3ZsMtX8/edit?usp=sharing))
5858
We have thoroughly tested fastdup across various famous visual datasets. Ranging from pilar Academic datasets to Kaggle competitions. A key finding we have made using FastDup is that there are ~1.2M (!) duplicate images on the ImageNet-21K dataset, out of which 104K pairs belong both to the train and to the val splits (this amounts to 20% of the validation set). This is a new unknown result! Full results are below. * train/val splits are taken from https://github.com/Alibaba-MIIL/ImageNet21 .
5959

6060
|Dataset |Total Images |cost [$]|spot cost [$]|processing [sec]|Identical pairs|Anomalies|

0 commit comments

Comments
 (0)