Skip to content

[BUG] can't load datasets from online zip folders with data in subdirectories #571

@evangeline-corcoran

Description

@evangeline-corcoran

Describe the bug

Can add a dataset to the scivision catalogue that is stored in an online repository as a .zip folder containing multiple sub-directories with data, but when using load_dataset function to retrieve the data, it does not load as sub-directory structure seemingly can't be parsed.

How To Reproduce

  1. Add dataset to scivision data catalogue with data.yml configuration as follows: sources: part_1: description: Top and side view RGB png images as analysed for 'Integrated Phenomics and Genomics reveals genetic loci associated with inflorescence growth in Brassica napus'. Dataset creator and collected by Kevin Williams, Jo Hepworth, and John Doonan at Aberystwyth University/National Plant Phenomics Centre origin: driver: intake_xarray.image.ImageSource args: urlpath: ["zip://Part_1/{plantID}/{date}/*.png::https://research.aber.ac.uk/files/64347697/Part_1.zip"]
    2.Attempt to load data from scivision catalogue with load_dataset function

Expected behaviour

Images to be found and loaded from sub-directories to create one dataset that can be used as input to the model.predict() function

Metadata

Metadata

Labels

bugSomething isn't working

Type

No type

Projects

Relationships

None yet

Development

No branches or pull requests

Issue actions