|
15 | 15 | "source": [ |
16 | 16 | "# Analyzing Hugging Face Datasets\n", |
17 | 17 | "\n", |
| 18 | + "[](https://colab.research.google.com/github/visual-layer/fastdup/blob/main/examples/analyzing-hf-datasets.ipynb)\n", |
| 19 | + "[](https://kaggle.com/kernels/welcome?src=https://github.com/visual-layer/fastdup/blob/main/examples/analyzing-hf-datasets.ipynb)\n", |
| 20 | + "\n", |
18 | 21 | "This notebook shows how you can use fastdup to analyze any datasets from [Hugging Face Datasets](https://huggingface.co/docs/datasets/index).\n", |
19 | 22 | "\n", |
20 | 23 | "We will analyze an image classification dataset for:\n", |
|
2579 | 2582 | "Try it out and let us know what issues you find.\n", |
2580 | 2583 | "\n", |
2581 | 2584 | "\n", |
2582 | | - "We recommend checking out -\n", |
| 2585 | + "Next, feel free to check out other tutorials -\n", |
2583 | 2586 | "\n", |
2584 | | - "- [**Quick Dataset Analysis**](https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/quick-dataset-analysis.ipynb) - Learn how to quickly analyze a dataset for potential issues. Identify duplicates, outliers, dark/bright/blurry images, and cluster similar images with only a few lines of code.\n", |
| 2587 | + "+ ⚡ [**Quickstart**](https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/quick-dataset-analysis.ipynb): Learn how to install fastdup, load a dataset and analyze it for potential issues such as duplicates/near-duplicates, broken images, outliers, dark/bright/blurry images, and view visually similar image clusters. If you're new, start here!\n", |
| 2588 | + "+ 🧹 [**Clean Image Folder**](https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/cleaning-image-dataset.ipynb): Learn how to analyze and clean a folder of images from potential issues and export a list of problematic files for further action. If you have an unorganized folder of images, this is a good place to start.\n", |
| 2589 | + "+ 🖼 [**Analyze Image Classification Dataset**](https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/analyzing-image-classification-dataset.ipynb): Learn how to load a labeled image classification dataset and analyze for potential issues. If you have labeled ImageNet-style folder structure, have a go!\n", |
| 2590 | + "+ 🎁 [**Analyze Object Detection Dataset**](https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/analyzing-object-detection-dataset.ipynb): Learn how to load bounding box annotations for object detection and analyze for potential issues. If you have a COCO-style labeled object detection dataset, give this example a try. " |
| 2591 | + ] |
| 2592 | + }, |
| 2593 | + { |
| 2594 | + "cell_type": "markdown", |
| 2595 | + "id": "08fd287b", |
| 2596 | + "metadata": {}, |
| 2597 | + "source": [ |
2585 | 2598 | "\n", |
2586 | | - "- [**Cleaning Image Dataset**](https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/cleaning-image-dataset.ipynb) - Learn how to clean a dataset from broken images, duplicates, outliers, and identify dark/bright/blurry images.\n", |
| 2599 | + "# VL Profiler\n", |
| 2600 | + "If you prefer a no-code platform to inspect and visualize your dataset, [**try our free cloud product VL Profiler**](https://app.visual-layer.com) - VL Profiler is our first no-code commercial product that lets you visualize and inspect your dataset in your browser. \n", |
2587 | 2601 | "\n", |
2588 | | - "- [**Try our free cloud product VL Profiler**](https://app.visual-layer.com) - VL Profiler is our first no-code commercial product that lets you visualize and inspect your dataset in your browser.\n", |
| 2602 | + "[Sign up](https://app.visual-layer.com) now, it's free.\n", |
2589 | 2603 | "\n", |
2590 | 2604 | "[](https://app.visual-layer.com)\n", |
2591 | 2605 | "\n", |
2592 | | - "As usual, feedback is welcome! Drop by our [Slack channel](https://visualdatabase.slack.com/join/shared_invite/zt-19jaydbjn-lNDEDkgvSI1QwbTXSY6dlA#/shared-invite/email) if you have questions!\n", |
2593 | | - "Happy learning 😀" |
| 2606 | + "As usual, feedback is welcome! \n", |
| 2607 | + "\n", |
| 2608 | + "Questions? Drop by our [Slack channel](https://visualdatabase.slack.com/join/shared_invite/zt-19jaydbjn-lNDEDkgvSI1QwbTXSY6dlA#/shared-invite/email) or open an issue on [GitHub](https://github.com/visual-layer/fastdup/issues)." |
2594 | 2609 | ] |
2595 | 2610 | } |
2596 | 2611 | ], |
|
0 commit comments