Ft/ensemble changes #115

fatemetkl · 2026-01-08T18:21:00Z

PR Type

[Feature | Fix | Documentation | Other ]

Short Description

Clickup Ticket(s): Link(s) if applicable.

Add a short description of what is in this PR.

Tests Added

Describe the tests that have been added to ensure the codes correctness, if applicable.

… uses target synthetic data.

…rent experimental setups

* Added testing several targets on multiple gpus * Added a comment

…etic data

fatemetkl · 2026-01-09T15:22:53Z

examples/ensemble_attack/configs/experiment_config.yaml

  challenge_data_path: ${target_model.target_model_directory}/${target_model.target_model_name}/challenge_with_id.csv
  challenge_label_path: ${target_model.target_model_directory}/${target_model.target_model_name}/challenge_label.csv

-  target_attack_artifact_dir: ${base_experiment_dir}/target_${target_model.target_model_id}_attack_artifacts/


This directory was extra and can be removed.

fatemetkl · 2026-01-09T19:31:46Z

examples/ensemble_attack/configs/experiment_config.yaml

@@ -1,34 +1,36 @@
 # Ensemble experiment configuration


The current configuration contains a large number of variables and data paths. While all of them are necessary for the experiment, even detailed comments can’t fully clarify the purpose of each element.

As part of a future refactor, I suggest splitting the configuration into multiple smaller configs, each dedicated to a specific part of the attack (e.g., data‑collection pipeline, training pipeline, testing pipeline). This will introduce some overhead, since users will need to manage several config files and ensure they stay aligned, but the gain in clarity and maintainability is worth it.

I did appreciate the convenience of controlling everything from a single config, as it made running many experiments and adjusting parameters very fast. However, this convenience comes at the cost of readability, which becomes a real issue as other users begin to use it.

Maybe it's worth taking a shot at just using better variable names in one config. So far, we've tried to align the names with the original attack code, which also uses very vague and non-self-explanatory variable names. But they should change throughout the code.

TL;DR:
The current single config is hard to understand because it mixes many variables and data paths with unclear names inherited from the original attack code. Splitting it into multiple pipeline‑specific configs would improve clarity and maintainability, even if it adds some overhead. Alternatively, improving variable naming within one config could be helpful.

fatemetkl · 2026-01-09T19:58:12Z

examples/ensemble_attack/real_data_collection.py

            )

-        population.append(df_real)
+            population.append(df_real)


This was a bug! Thank you for catching this, Sara!

fatemetkl · 2026-01-09T20:13:10Z

examples/ensemble_attack/run_shadow_model_training.py

    # Load the required dataframes for shadow model training.
    # For shadow model training we need master_challenge_train and population data.
    # Master challenge is the main training (or fine-tuning) data for the shadow models.
-    df_master_challenge_train = load_dataframe(


Instead of loading the data here, it is passed to the function.

fatemetkl · 2026-01-09T20:16:38Z

src/midst_toolkit/attacks/ensemble/rmia/shadow_model_training.py

            f"Fine-tuned model {model_id} generated {len(train_result.synthetic_data)} synthetic samples.",
        )
-        attack_data["fine_tuned_results"].append(train_result)
+        attack_data["fine_tuned_results"].append(train_result.synthetic_data)


We only need to save the synthetic data.

fatemetkl added 30 commits November 12, 2025 14:57

Fixed 2 bugs: shadow synth data size, and var name

27fa9c8

Remove dependency on target’s training result object; attack now only…

f64b650

… uses target synthetic data.

Added test script that works with the trained tabddpm models in diffe…

0d4a3b6

…rent experimental setups

Updated test

9c1217d

pre-commit checks

0a12084

Merged main into this branch, addressed conflicts

81c0bb3

Minor fixes

a19a595

Minor fixes

e93b3c1

Removed extra line

70cb1af

Sara's comments

942475d

Addressed Marcelo's comments

de1cb95

Merged remote

819826c

Ensemble experiments: SLURM script (#97)

59e52d4

* Added testing several targets on multiple gpus * Added a comment

Small fix

cb7e65c

Added the success calculation script

1b76b08

Finalized the script

3e301f2

David Comments first pass

9545f4a

Removed extra comment

9c750ae

Just saving experiment configs for my own reference

e79f9cc

Latest changes

ecf22ab

Fixed population dataset for other settings

22c028c

Fix test shadow train data

19b67e8

Avoid saving all TrainingResult to reduce memory footprint

4977d1e

Updated test

8bd62ee

Cleaned the code

bcea5d0

Added experiment scripts

f691a8b

Merge branch 'main' into ft/ensemble_changes and resolved conflicts

4956fd4

Removed extra experiment scripts

4596949

Fixed typing issues

44b4164

Fixed unit tests

eeef708

fatemetkl added 9 commits January 7, 2026 11:22

Fixed integration tests as shadow model training now only saves synth…

f8962f8

…etic data

Fixed mypy errors

37323c6

mypy and ruff fixes

ee4b659

Merge branch 'main' into ft/ensemble_changes

be7e494

Small change

eea6ea5

extra line

d598206

Merged main into the brand

d14b22f

Merge remote-tracking branch 'origin/main' into ft/ensemble_changes

161c71b

End of file extra line

9afda5d

fatemetkl commented Jan 9, 2026

View reviewed changes

Improved documentation a bit

72c0e90

fatemetkl commented Jan 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ft/ensemble changes #115

Ft/ensemble changes #115

Uh oh!

fatemetkl commented Jan 8, 2026

Uh oh!

fatemetkl Jan 9, 2026

Uh oh!

fatemetkl Jan 9, 2026

Uh oh!

fatemetkl Jan 9, 2026

Uh oh!

fatemetkl Jan 9, 2026

Uh oh!

fatemetkl Jan 9, 2026

Uh oh!

fatemetkl Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ft/ensemble changes #115

Are you sure you want to change the base?

Ft/ensemble changes #115

Uh oh!

Conversation

fatemetkl commented Jan 8, 2026

PR Type

Short Description

Tests Added

Uh oh!

fatemetkl Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

fatemetkl Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

fatemetkl Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

fatemetkl Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

fatemetkl Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

fatemetkl Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants