Skip to content

Commit a80bc1a

Browse files
authored
Update README.md
1 parent 6a8c37d commit a80bc1a

File tree

1 file changed

+0
-1
lines changed

1 file changed

+0
-1
lines changed

README.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@ From the authors of [GraphLab](https://github.com/jegonzal/PowerGraph) and [Turi
2525
## Results on Key Datasets
2626
We have thoroughly tested fastdup across various famous visual datasets. Ranging from pilar Academic datasets to Kaggle competitions. A key finding we have made using FastDup is that there are ~1.2M (!) duplicate images on the ImageNet-21K dataset, out of which 104K pairs belong both to the train and to the val splits (this amounts to 20% of the validation set). This is a new unknown result! Full results are below. * train/val splits are taken from https://github.com/Alibaba-MIIL/ImageNet21 .
2727

28-
### FastDup is FAST
2928
|Dataset |Total Images |cost [$]|spot cost [$]|processing [sec]|Identical pairs|Anomalies|
3029
|-----------------------|---------------|--------|-------------|----------------|---------------|---------|
3130
|[imagenet21k-resized](https://www.image-net.org/challenges/LSVRC/) |11,582,724 |4.98 |1.24 |11,561 |[1,194,059](https://www.databasevisual.com/imagenet-21k-resized-leaks)|[Anomalies](https://www.databasevisual.com/imagenet-21k-anonalies) [Wrong Labels](https://www.databasevisual.com/imagenet-21k-wrong-labels)||

0 commit comments

Comments
 (0)