Distributed file read #2010

msl3v · 2025-07-23T14:03:43Z

msl3v
Jul 23, 2025
Collaborator

Hi PIO!

I'm developing a parallel read interface using GDAL for GIS files. I've mimicked the way that pio_read_darray_nc() behaves for NETCDF4P (you can say if this is not the right way). I'm curious, what controls the number of procs where the read occurs? It looks like maxregions. Is this correct? Regardless the file size other file configs, the read always happens on one proc (even if num_iotasks > 1). I simply want to test that the parallel read works and the array is formed correctly.

Thank you.

Answered by jedwards4b

Jul 23, 2025

The number of tasks that participate in the read is controlled by variable num_iotasks in the call to pio_init.
https://github.com/NCAR/ParallelIO/blob/main/src/clib/pioc.c#L1272

If the read is only happening on a single task regardless of the value of numiotasks, this suggests that you are using the box rearranger with a rather small decomposition. maxregions is an internal variable that has nothing to do with the number of io-tasks and is related to the fragmentation of the data in memory with respect to the file order. I would recommend looking into the pnetcdf interface, it is generally faster than that of netcdf4/hdf5.

View full answer

jedwards4b · 2025-07-23T14:27:25Z

jedwards4b
Jul 23, 2025
Maintainer

The number of tasks that participate in the read is controlled by variable num_iotasks in the call to pio_init.
https://github.com/NCAR/ParallelIO/blob/main/src/clib/pioc.c#L1272

If the read is only happening on a single task regardless of the value of numiotasks, this suggests that you are using the box rearranger with a rather small decomposition. maxregions is an internal variable that has nothing to do with the number of io-tasks and is related to the fragmentation of the data in memory with respect to the file order. I would recommend looking into the pnetcdf interface, it is generally faster than that of netcdf4/hdf5.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Distributed file read #2010

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Distributed file read #2010

Uh oh!

msl3v Jul 23, 2025 Collaborator

Replies: 1 comment

Uh oh!

jedwards4b Jul 23, 2025 Maintainer

msl3v
Jul 23, 2025
Collaborator

jedwards4b
Jul 23, 2025
Maintainer