[API 2]: CFI, PFI, LOCO #372

lionelkusch · 2025-09-02T14:54:39Z

Update the model of CFI, PFI and LOCO for API 2.

codecov · 2025-09-02T15:01:52Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.06%. Comparing base (1f97f5b) to head (eca0c0c).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #372      +/-   ##
==========================================
+ Coverage   98.94%   99.06%   +0.12%     
==========================================
  Files          23       21       -2     
  Lines        1424     1393      -31     
==========================================
- Hits         1409     1380      -29     
+ Misses         15       13       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jpaillard

It looks good but the diff seems very large for this small change.
Is there a reason for all the other modifications?

lionelkusch · 2025-09-02T15:58:42Z

I reorganize a bit the parameter in the init and move the docstring to the class because in all the other classes, I plan to do this.

By looking into more details, I miss some parts being added. I will add it and ask you to review it after. Sorry for it.

bthirion

This PR is definitely an improvement, thx.

src/hidimstat/base_perturbation.py

src/hidimstat/conditional_feature_importance.py

src/hidimstat/leave_one_covariate_out.py

src/hidimstat/permutation_feature_importance.py

src/hidimstat/base_perturbation.py

src/hidimstat/conditional_feature_importance.py

Co-authored-by: Joseph Paillard <[email protected]>

bthirion

Thx for the progress. Please find a few suggestions enclosed.

src/hidimstat/base_perturbation.py

src/hidimstat/leave_one_covariate_out.py

test/test_permutation_feature_importance.py

src/hidimstat/base_perturbation.py

Co-authored-by: bthirion <[email protected]>

src/hidimstat/_utils/utils.py

jpaillard · 2025-10-23T12:04:07Z

src/hidimstat/_utils/utils.py

+            return partial(
+                nadeau_bengio_ttest,
+                popmean=0,
+                test_frac=0.1 / 0.9,


One solution is to make this function a class function, only called during the .importance(X_test, y_test) and to add a fitted attribute self.n_train_ that is set during the fit.

I prefer to let the user define it for the moment.

It's a possible solution to add a required argument test_frac to the instantiation. But since this comes for free in the fit/importance process, I would suggest that hidimstat takes care of it to avoid mistakes from users and limit the number of required arguments in the initialization of the class.

This is not supported by the other statistical tests.
We should support the different statistical tests or choose one specific and not let the choice of the users.
I don't see the point of the moment to have a different behaviour for this specific test.

It has too. You can see the section inference of the user guide.

When cross-validation (for instance, k-fold) is used to estimate CFI, the loss differences obtained from different folds are not independent. Consequently, performing a simple t-test on the loss differences is not valid. This issue can be addressed by a corrected t-test accounting for this dependence, such as the one proposed in Nadeau and Bengio[3].

src/hidimstat/base_perturbation.py

Co-authored-by: Joseph Paillard <[email protected]>

bthirion

No further comment than those raised by @jpaillard

jpaillard · 2025-10-26T17:10:07Z

Regarding the statistical test, the current issue is that the cross-validation scheme has not yet been implemented. Currently, the statistical test is performed using a single train/test split. In that case, when considering the loss values for each individual sample of the test set, they can be considered as independent and we could use ttest instead of nb-ttest as the default.

The NB-t-test actually needs to be the default when CV is used, and losses over test sets cannot be considered as independent.

The comment regarding the nb-ttest implementation remains valid; however, the test fraction shouldn't be hardcoded. I will follow up on #449

jpaillard · 2025-10-26T19:01:33Z

I made 'ttest' as the default statistical_test for the current implem of CFI, which doesn't use CV.
We can keep the implementation of nb-ttest. I fixed the hardcoded test_frac. 'nb-ttest' will be the default statistical_test for LOCO_CV, CFI_CV ... in Cross Validation #449

Let me know if that's good for you. Sorry for the confusion.

jpaillard · 2025-10-27T10:05:15Z

If my last two comments are OK for you, @bthirion, this is ready to merge.
It's otherwise blocking #449

lionelkusch · 2025-10-27T15:04:12Z

@jpaillard I let you deal with this PR.

jpaillard · 2025-11-06T09:04:34Z

I take the opportunity to rename the function loco --> loco_analysis as discussed in #499

bthirion

Very minor stuff pending. Thx !

src/hidimstat/_utils/utils.py

src/hidimstat/leave_one_covariate_out.py

src/hidimstat/conditional_feature_importance.py

src/hidimstat/leave_one_covariate_out.py

src/hidimstat/permutation_feature_importance.py

Co-authored-by: bthirion <[email protected]>

jpaillard · 2025-11-07T07:41:50Z

Sorry again for the confusion regarding the default test. I explained the choice a few comments above.

Regarding the statistical test, the current issue is that the cross-validation scheme has not yet been implemented. Currently, the statistical test is performed using a single train/test split. In that case, when considering the loss values for each individual sample of the test set, they can be considered as independent and we could use ttest instead of nb-ttest as the default.

The NB-t-test actually needs to be the default when CV is used, and losses over test sets cannot be considered as independent.
...

bthirion · 2025-11-07T22:45:32Z

Sorry again for the confusion regarding the default test. I explained the choice a few comments above.

Regarding the statistical test, the current issue is that the cross-validation scheme has not yet been implemented. Currently, the statistical test is performed using a single train/test split. In that case, when considering the loss values for each individual sample of the test set, they can be considered as independent and we could use ttest instead of nb-ttest as the default.
The NB-t-test actually needs to be the default when CV is used, and losses over test sets cannot be considered as independent.
...

OK, makes sense.

bthirion · 2025-11-07T22:45:47Z

I think it's OK for merging.

lionelkusch added 3 commits September 2, 2025 16:23

New API for CFI, PFI, LOCO

df93c78

fix test for new API

ccb60ed

fix example

7c827ad

lionelkusch added the API 2 Refactoring following the second version of API label Sep 2, 2025

lionelkusch requested review from bthirion and jpaillard September 2, 2025 14:54

add test for new check

82d61e6

jpaillard reviewed Sep 2, 2025

View reviewed changes

lionelkusch added 3 commits September 2, 2025 18:13

add pvalue and fit_importance and function

28593e4

Add new function

cabfb63

fix docstring

7d7fd7d

bthirion reviewed Sep 2, 2025

View reviewed changes

jpaillard reviewed Sep 3, 2025

View reviewed changes

src/hidimstat/conditional_feature_importance.py Outdated Show resolved Hide resolved

jpaillard reviewed Sep 3, 2025

View reviewed changes

src/hidimstat/conditional_feature_importance.py Outdated Show resolved Hide resolved

lionelkusch mentioned this pull request Sep 3, 2025

parallelisation of cross-validation in fit_importance #373

Open

lionelkusch and others added 8 commits September 3, 2025 15:19

Improve cross validation

b958cc7

update docstring

1f97d60

update doctring

db96bb6

fix error

d656f17

fix docstring

0493b6f

Apply suggestions from code review

9c54e1b

Co-authored-by: Joseph Paillard <[email protected]>

Update default

7bf75e4

fix tests

b3cd78a

bthirion reviewed Sep 7, 2025

View reviewed changes

lionelkusch and others added 3 commits September 8, 2025 10:59

Apply suggestions from code review

7825490

Co-authored-by: bthirion <[email protected]>

chnage group by features_groups

084ad24

fix format

7379ec1

jpaillard reviewed Oct 23, 2025

View reviewed changes

Update src/hidimstat/base_perturbation.py

cfc12d0

Co-authored-by: Joseph Paillard <[email protected]>

bthirion reviewed Oct 23, 2025

View reviewed changes

lionelkusch added 3 commits October 24, 2025 17:21

Merge branch 'main' into PR_CFI

7a4f44a

Remove unecessary check

d14b835

update loco

87e2029

make ttest the default without CV

c61bb44

lionelkusch mentioned this pull request Oct 27, 2025

On going Refactoring plan #515

Open

5 tasks

jpaillard added 2 commits October 29, 2025 22:13

Merge branch 'main' into PR_CFI

b0c4ec0

Merge branch 'main' of github.com:mind-inria/hidimstat into PR_CFI

359118a

jpaillard changed the title ~~API 2: CFI, PFI, LOCO~~ [API 2]: CFI, PFI, LOCO Nov 6, 2025

rename functions

1deae93

jpaillard requested a review from bthirion November 6, 2025 09:04

fix import

911f11c

bthirion reviewed Nov 6, 2025

View reviewed changes

jpaillard and others added 2 commits November 7, 2025 08:38

Update src/hidimstat/_utils/utils.py

bc4ee65

Co-authored-by: bthirion <[email protected]>

add test_frac

e93b97f

jpaillard added 2 commits November 7, 2025 10:13

Merge branch 'main' of github.com:mind-inria/hidimstat into PR_CFI

e03266e

init

eca0c0c

jpaillard mentioned this pull request Nov 7, 2025

[MNT] clean-up source imports #525

Merged

jpaillard merged commit acb8e20 into mind-inria:main Nov 8, 2025
24 checks passed

jpaillard deleted the PR_CFI branch November 8, 2025 12:00

[API 2]: CFI, PFI, LOCO #372

[API 2]: CFI, PFI, LOCO #372

Uh oh!

Conversation

lionelkusch commented Sep 2, 2025

Uh oh!

codecov bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jpaillard left a comment

Choose a reason for hiding this comment

Uh oh!

lionelkusch commented Sep 2, 2025

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jpaillard Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

lionelkusch Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

jpaillard Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

lionelkusch Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

jpaillard Oct 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

jpaillard commented Oct 26, 2025

Uh oh!

jpaillard commented Oct 26, 2025

Uh oh!

jpaillard commented Oct 27, 2025

Uh oh!

lionelkusch commented Oct 27, 2025

Uh oh!

jpaillard commented Nov 6, 2025

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Sep 2, 2025 •

edited

Loading