Test speeding up backtest output saving to a single zarr

Currently our backtest scripts output a .nc file for each forecast t0 specified in the backtest range, sometimes that can mean many 1000s of files, I have found that when opening all of these .nc files into a single xarray dataset like this: `xr.open_mfdataset(f"{output_dir}/*.nc", parallel=True)` can sometimes be very slow (even with the parallel=True parameter) when it's a large number of files (in my case this was around ~35000 files), I had some success speeding this up using python multiprocessing after following the advice here https://stackoverflow.com/questions/65587633/ways-to-speed-up-open-mfdataset-in-xarray this issue is to benchmark the different ways of doing this and see how much quicker we can make it 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test speeding up backtest output saving to a single zarr #512

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Test speeding up backtest output saving to a single zarr #512

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions