Feature: support multi-dimensional batch calculation[experimental] #1186

TonyXiang8787 · 2025-11-11T09:18:42Z

This PR experiments an idea of multi-dimensional batch calculation in the C-API without introducing breaking changes. The Python API is also adjusted without breaking changes.

Signed-off-by: Tony Xiang <[email protected]>

sonarqubecloud · 2025-11-12T08:26:23Z

Quality Gate passed

Issues
9 New issues
0 Accepted issues

Measures
0 Security Hotspots
88.6% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

TonyXiang8787 · 2025-11-17T09:45:36Z

power_grid_model_c/power_grid_model_c/include/power_grid_model_c/dataset.h

+/**
+ * @brief Create a PGM_MultiDimensionalDataset from multiple PGM_ConstDataset instances
+ *
+ * @param handle
+ * @param const_datasets
+ * @param n_datasets
+ * @return
+ */
+PGM_API PGM_MultiDimensionalDataset*
+PGM_dataset_create_multidimensional_from_const(PGM_Handle* handle, PGM_ConstDataset const** const_datasets,
+                                               PGM_Idx n_datasets);
+
+/**
+ * @brief Get the array pointer from a PGM_MultiDimensionalDataset
+ *
+ * @param handle
+ * @param multidimensional_dataset
+ * @return
+ */
+PGM_API PGM_ConstDataset const*
+PGM_get_array_pointer_from_multidimensional(PGM_Handle* handle,
+                                            PGM_MultiDimensionalDataset const* multidimensional_dataset);
+
+/**
+ * @brief destroy the multidimensional dataset object
+ *
+ * @param multidimensional_dataset
+ */
+PGM_API void PGM_destroy_multidimensional_dataset(PGM_MultiDimensionalDataset* multidimensional_dataset);
+


@mgovers This is one option to pass a MD datasets into the C-API. You need to have additional helper functions to create an array of datasets. We need a new opaque struct.

Another interesting approach would be using chaining. So define a function called dataset_const_chain_another, which store a pointer to the next dimension dataset into this one. Internnaly we can check if the next_dataset pointer is nullptr or not to traverse dimensions. This way is less intuitive, but still used in many other APIs. And you only need one extra function. We don't even need to change the options.

i prefer this separate ND approach, but how would we use ND-datasets, e.g. a 3-dimensional one instead of only 2-dimensional

It is not really a ND, but a list of datasets. The core will make a cross-join ND calculation of combinations in the list.

The questions, do we create separate opaque struct to store an array of datasets? This requires a lot of new functions. Or do we do the chaining, this only needs one new function.

i do believe under the current implementation, sure, but it also restricts us going forward. i feel like if we are going to support 1D and have a separate object for 2D, then we might as well go for ND. otherwise, we'll fall into the same rabbit hole that C++ has been in (watch any video on std::mdspan and you'll know, they are better at explaining what i mean than i am)

The current experiment is about ND, not just 2D.

The ND calculation is created by a list of datasets. Each dataset in the list represents the mutation of that dimension. The core creates a cross-join on the mutations.

mgovers · 2025-11-17T10:09:55Z

power_grid_model_c/power_grid_model/include/power_grid_model/auxiliary/dataset.hpp

        requires(!is_indptr_mutable_v<dataset_type>)
    {
-        assert(0 <= scenario && scenario < batch_size());
+        assert(0 <= scenario && scenario <= batch_size());


i think this will crash because buffer.indptr[scenario + 1] is called. In python, it's also not possible to call [1, 2, 3][3] (IndexOutOfRange)

mgovers · 2025-11-17T10:10:11Z

power_grid_model_c/power_grid_model/include/power_grid_model/auxiliary/dataset.hpp

    }

+    // get slice dataset from batch
+    Dataset get_slice_scenario(Idx begin, Idx end) const


really useful 👍 If we add an additional increment here, we can actually also use this in the dispatching of multithreaded batch calculations

mgovers · 2025-11-17T10:15:08Z

power_grid_model_c/power_grid_model_c/src/forward_declarations.hpp

 using PGM_MutableDataset = power_grid_model::meta_data::Dataset<power_grid_model::mutable_dataset_t>;
 using PGM_WritableDataset = power_grid_model::meta_data::Dataset<power_grid_model::writable_dataset_t>;
 using PGM_DatasetInfo = power_grid_model::meta_data::DatasetInfo;
+using PGM_MultiDimensionalDataset = std::vector<PGM_ConstDataset>;


i feel like this should be able to use arbitrary dimensions

C++23 has std::mdspan, which would probably solve this

mgovers · 2025-11-17T10:16:01Z

power_grid_model_c/power_grid_model_c/include/power_grid_model_c/dataset.h

+ *
+ * @param multidimensional_dataset
+ */
+PGM_API void PGM_destroy_multidimensional_dataset(PGM_MultiDimensionalDataset* multidimensional_dataset);


inconsistent PGM_dataset_create_* vs PGM_destroy_*

mgovers · 2025-11-17T10:19:27Z

power_grid_model_c/power_grid_model_c/include/power_grid_model_c/basics.h

+ * @brief Opaque struct for the multi dimensional dataset class.
+ *
+ */
+typedef struct PGM_MultiDimensionalDataset PGM_MultiDimensionalDataset;


maybe following the conventions of std::mdspan:

Suggested change

typedef struct PGM_MultiDimensionalDataset PGM_MultiDimensionalDataset;

typedef struct PGM_MDDataset PGM_MDDataset;

mgovers · 2025-11-17T10:23:30Z

power_grid_model_c/power_grid_model_c/include/power_grid_model_c/dataset.h

+ */
+PGM_API PGM_ConstDataset const*
+PGM_get_array_pointer_from_multidimensional(PGM_Handle* handle,
+                                            PGM_MultiDimensionalDataset const* multidimensional_dataset);


maybe we can do a multi-indexing approach instead? e.g.

PGM_mddataset_get_data(PGM_Handle* handle, PGM_MDDataset const* mddataset); PGM_mddataset_get_flat_element(PGM_Handle* handle, PGM_MDDataset const* mddataset, PGM_Idx flattened_index); PGM_mddataset_get_scenario(PGM_Handle* handle, PGM_MDDataset const* mddataset, PGM_Idx** multi_index, PGM_Idx n_dimensions);

(we can even omit n_dimensions, although it's probably safer to keep it)

TonyXiang8787 added 17 commits November 10, 2025 10:10

start slice dataset

8085e5b

Signed-off-by: Tony Xiang <[email protected]>

add slice scenario

73d85c5

Signed-off-by: Tony Xiang <[email protected]>

Merge branch 'main' into experimental/multi-dimension-batch

234f38d

batch dimension

be44d80

Signed-off-by: Tony Xiang <[email protected]>

calculation implementation

1fd7ed3

Signed-off-by: Tony Xiang <[email protected]>

error handling still needs to be done

45f4d72

Signed-off-by: Tony Xiang <[email protected]>

error handling

06cb330

Signed-off-by: Tony Xiang <[email protected]>

add batch dimensions

a4fc01e

Signed-off-by: Tony Xiang <[email protected]>

batch dimension

118655f

Signed-off-by: Tony Xiang <[email protected]>

start test

3b41f28

Signed-off-by: Tony Xiang <[email protected]>

start test

d88db97

Signed-off-by: Tony Xiang <[email protected]>

api will not work as intended

1b1190d

Signed-off-by: Tony Xiang <[email protected]>

api will not work as intended

21c31c5

Signed-off-by: Tony Xiang <[email protected]>

adjust md dataset

6e8c081

Signed-off-by: Tony Xiang <[email protected]>

add dataset

0b989b4

Signed-off-by: Tony Xiang <[email protected]>

crash yet

14b4039

Signed-off-by: Tony Xiang <[email protected]>

fix bounds checking

3338881

Signed-off-by: Tony Xiang <[email protected]>

TonyXiang8787 self-assigned this Nov 11, 2025

TonyXiang8787 added feature New feature or request do-not-merge This should not be merged labels Nov 11, 2025

TonyXiang8787 added 10 commits November 11, 2025 10:41

remove span

01c86a9

Signed-off-by: Tony Xiang <[email protected]>

fix clang tidy

7786b0c

Signed-off-by: Tony Xiang <[email protected]>

format|

5a3f394

Signed-off-by: Tony Xiang <[email protected]>

[skip ci] add cfunc in python

2650968

Signed-off-by: Tony Xiang <[email protected]>

force nullptr

9aaa6bd

Signed-off-by: Tony Xiang <[email protected]>

add options

e4aa439

Signed-off-by: Tony Xiang <[email protected]>

proxy for multidimensional in python

237681f

Signed-off-by: Tony Xiang <[email protected]>

modify main calculate input

b602f2e

Signed-off-by: Tony Xiang <[email protected]>

type annotation

3e79237

Signed-off-by: Tony Xiang <[email protected]>

[skip ci] not working yet

8d8d80c

Signed-off-by: Tony Xiang <[email protected]>

fix dimensions

61cfd57

Signed-off-by: Tony Xiang <[email protected]>

TonyXiang8787 changed the title ~~Feature: support multi-dimensional batch calculation from C-API [experimental]~~ Feature: support multi-dimensional batch calculation[experimental] Nov 11, 2025

fix mypy

a1331ba

Signed-off-by: Tony Xiang <[email protected]>

TonyXiang8787 linked an issue Nov 12, 2025 that may be closed by this pull request

[FEATURE] 2-D or cross-join of batch updates #1176

Open

TonyXiang8787 mentioned this pull request Nov 12, 2025

[FEATURE] 2-D or cross-join of batch updates #1176

Open

TonyXiang8787 commented Nov 17, 2025

View reviewed changes

mgovers reviewed Nov 17, 2025

View reviewed changes

	typedef struct PGM_MultiDimensionalDataset PGM_MultiDimensionalDataset;
	typedef struct PGM_MDDataset PGM_MDDataset;

Feature: support multi-dimensional batch calculation[experimental] #1186

Are you sure you want to change the base?

Feature: support multi-dimensional batch calculation[experimental] #1186

Conversation

TonyXiang8787 commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonarqubecloud bot commented Nov 12, 2025

Quality Gate passed

Uh oh!

TonyXiang8787 Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TonyXiang8787 Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mgovers Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TonyXiang8787 commented Nov 11, 2025 •

edited

Loading

TonyXiang8787 Nov 17, 2025 •

edited

Loading

TonyXiang8787 Nov 17, 2025 •

edited

Loading

mgovers Nov 17, 2025 •

edited

Loading