-
Notifications
You must be signed in to change notification settings - Fork 36
Open
Description
I'm curious about the comment in this test that sort of mentions that array data shouldn't be serialized when pickling. Does that mean the contents of the array itself? Or does that also cover fragment metadata?
TileDB-Py/tiledb/tests/test_libtiledb.py
Lines 3050 to 3053 in 0a150c0
| class PickleTest(DiskTestCase): | |
| # test that DenseArray and View can be pickled for multiprocess use | |
| # note that the current pickling is by URI and attributes (it is | |
| # not, and likely should not be, a way to serialize array data) |
It would be neat I think if the pickled array included fragment metadatas so each worker doesn't need to separately download the files from VFS (which may lead to consistency problems?)
Also, is there a reason this is only for Dense arrays currently? Or just because that's the Dask integration that exists? Happy to look at contributing anything along these lines :)
Metadata
Metadata
Assignees
Labels
No labels