Skip to content

Commit 7df2327

Browse files
committed
simplify more
1 parent e34eef5 commit 7df2327

File tree

1 file changed

+17
-7
lines changed

1 file changed

+17
-7
lines changed

examples/analyzing-kaggle-datasets.ipynb

Lines changed: 17 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -18,13 +18,7 @@
1818
"[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/visual-layer/fastdup/blob/main/examples/analyzing-kaggle-datasets.ipynb)\n",
1919
"[![Open in Kaggle](https://kaggle.com/static/images/open-in-kaggle.svg)](https://kaggle.com/kernels/welcome?src=https://github.com/visual-layer/fastdup/blob/main/examples/analyzing-kaggle-datasets.ipynb)\n",
2020
"\n",
21-
"This notebook shows how you can use fastdup to analyze any datasets from [Kaggle](https://kaggle.com).\n",
22-
"\n",
23-
"We will analyze an image classification dataset for:\n",
24-
"\n",
25-
"+ Duplicates / near-duplicates.\n",
26-
"+ Outliers.\n",
27-
"+ Wrong labels."
21+
"This notebook shows how you can use [fastdup](https://github.com/visual-layer/fastdup) to analyze any computer vision datasets from [Kaggle](https://kaggle.com)."
2822
]
2923
},
3024
{
@@ -181,6 +175,14 @@
181175
"!unzip -q the-rvlcdip-dataset-test.zip"
182176
]
183177
},
178+
{
179+
"cell_type": "markdown",
180+
"id": "1f8d6b66-3f53-4afb-b040-c5d91a628608",
181+
"metadata": {},
182+
"source": [
183+
"Once completed, we should have a folder with the name `test/` which contains all the images from the dataset."
184+
]
185+
},
184186
{
185187
"cell_type": "markdown",
186188
"id": "41f2abee-1251-4500-8ebf-90c593b6157a",
@@ -246,6 +248,14 @@
246248
"## Run fastdup"
247249
]
248250
},
251+
{
252+
"cell_type": "markdown",
253+
"id": "a10910f4-b772-400b-96b6-f44b62b97fe0",
254+
"metadata": {},
255+
"source": [
256+
"To run fastdup, we only need to point `input_dir` to the folder containing images from the dataset."
257+
]
258+
},
249259
{
250260
"cell_type": "code",
251261
"execution_count": 6,

0 commit comments

Comments
 (0)