Add Support for Loading CSV datasets #1042
Replies: 2 comments 1 reply
-
Hi, I understand the motivation for this task, but I'm not sure about the added advantage of a csv dataloader for the average user. Maybe you could expand the scope a bit and add support for an existing dataset annotation format such as coco. In this case the dataset structure would not be defined by a Some things to consider:
|
Beta Was this translation helpful? Give feedback.
-
Created a PR here: #1050 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
What is the motivation for this task?
Currently, training anomalib on custom dataset requires images to be physically separated into folders. To keep track of data revisions, over time we are forced to keep duplicated copies of images. This leads to high usage of local disk space.
The use case of shuffling and experimenting with a subset of training images will also benefit from this solution.
Describe solution
Supporting data loading through CSVs solves this problem and allow a master copy of images to be stored only once - and use CSVs containing list of paths pointing to these images as an interface to dataset ingestion in anomalib.
CSVs to follow a pre-defined schema, to contain at least:
image_path
- Relative path to image (fromroot
in config)Additional context
(proposal from team 2 for the OSS Hackathon)
Beta Was this translation helpful? Give feedback.
All reactions