Skip to content

Include fragment metadata when pickling array #614

@gatesn

Description

@gatesn

I'm curious about the comment in this test that sort of mentions that array data shouldn't be serialized when pickling. Does that mean the contents of the array itself? Or does that also cover fragment metadata?

class PickleTest(DiskTestCase):
# test that DenseArray and View can be pickled for multiprocess use
# note that the current pickling is by URI and attributes (it is
# not, and likely should not be, a way to serialize array data)

It would be neat I think if the pickled array included fragment metadatas so each worker doesn't need to separately download the files from VFS (which may lead to consistency problems?)

Also, is there a reason this is only for Dense arrays currently? Or just because that's the Dask integration that exists? Happy to look at contributing anything along these lines :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions