Binary risk control - implementation batch 1 #735

Valentin-Laurent · 2025-07-29T15:07:33Z

No description provided.

Copilot

Pull Request Overview

This PR implements binary classification risk control in its simplest form, focusing on mono-risk and unidimensional lambda using thresholding on predict_proba. The implementation includes cleanup of existing risk control code and introduction of new binary classification risk control components.

Key changes:

Remove unnecessary components from existing LTT procedure (lambda=None check, p_values output)
Add support for array-based n_obs in p-value calculations for binary classification scenarios
Implement BinaryClassificationRisk class with predefined instances for precision, recall, and accuracy
Introduce BinaryClassificationController for threshold-based binary classification risk control

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
mapie/control_risk/ltt.py	Remove p_values return, lambda=None check, and add array support for n_obs
mapie/control_risk/p_values.py	Add array support for n_obs parameter in Hoeffding-Bentkus p-value computation
mapie/risk_control.py	Add BinaryClassificationRisk class and predefined risk instances
mapie/risk_control_draft.py	Implement BinaryClassificationController with threshold-based risk control
mapie/tests/test_control_risk.py	Update tests for LTT changes and add n_obs array testing
mapie/tests/test_risk_control.py	Add comprehensive tests for BinaryClassificationRisk instances
mapie/init.py	Remove risk_control_draft from exports

Comments suppressed due to low confidence (2)

mapie/tests/test_risk_control.py:848

The test should verify the specific condition that leads to None result (effective_sample_size == 0) rather than using a general elif clause. Consider adding an explicit assertion that effective_sample_func returns 0 when result is None.

    elif result is None:

mapie/risk_control_draft.py:18

The entire BinaryClassificationController class is marked with pragma: no cover, indicating missing test coverage. This class implements core functionality and should have comprehensive unit tests.

class BinaryClassificationController:  # pragma: no cover

mapie/risk_control.py

mapie/risk_control_draft.py

- Use BinaryClassificationRisk to compute risk - Use warning instead of error when risk is not controled. Throw error when predicting - Remove useless check on lambda=None in ltt_procedure - Remove useless p_values from ltt_procedure outputs - Add possibility to pass an array of n_obs to ltt_procedure and subsequent p-values calculations (needed for binary classification)

- Fix bentkus_p_value calculation - Fix and move higher_is_better logic in the same place - Implement unit test for BinaryClassificationRiskControl - Fix parametrizing of existing test

… positive predictions)

codecov-commenter · 2025-08-28T07:52:09Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (binary-risk-control@5811aa9). Learn more about missing BASE report.

Additional details and impacted files

@@                   Coverage Diff                   @@
##             binary-risk-control      #735   +/-   ##
=======================================================
  Coverage                       ?   100.00%           
=======================================================
  Files                          ?        56           
  Lines                          ?      6205           
  Branches                       ?       355           
=======================================================
  Hits                           ?      6205           
  Misses                         ?         0           
  Partials                       ?         0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…rove theoretical test notebook

Valentin-Laurent force-pushed the binary-risk-control-v1 branch from f883ca9 to 13462fc Compare July 30, 2025 09:03

Valentin-Laurent requested a review from Copilot July 30, 2025 13:40

Copilot AI reviewed Jul 30, 2025

View reviewed changes

mapie/risk_control.py Outdated Show resolved Hide resolved

mapie/risk_control.py Outdated Show resolved Hide resolved

mapie/risk_control_draft.py Show resolved Hide resolved

mapie/risk_control_draft.py Outdated Show resolved Hide resolved

Valentin-Laurent force-pushed the binary-risk-control-v1 branch from 8f2a127 to ed18964 Compare July 31, 2025 12:50

Valentin-Laurent force-pushed the binary-risk-control branch from 37c79dc to 4404ad1 Compare July 31, 2025 12:50

Valentin-Laurent force-pushed the binary-risk-control branch from 4404ad1 to 5811aa9 Compare August 27, 2025 12:55

Valentin-Laurent added 13 commits August 27, 2025 14:55

ENH: implement BinaryClassificationRisk and related instances

91daf3e

ENH: simplify BinaryClassificationRisk API

ded36fe

ENH & MTN & FIX

e8dae57

- Fix bentkus_p_value calculation - Fix and move higher_is_better logic in the same place - Implement unit test for BinaryClassificationRiskControl - Fix parametrizing of existing test

TEST - hoeffdding_bentkus_p_value with n_obs as an array

a580aa4

FIX - linting

9e8b092

ENH - Performance, warning and docstring improvements

5fbc940

FIX - Fix local typing issue, investigate CI typing issues

cc88354

FIX - Continue investigating CI typing issues

feb075d

MTN - remove relative import

bf28de0

ENH & TEST - Handle the case of undefined risk (ex: precision with no…

09e8751

… positive predictions)

MTN - Revert formatting to avoid changes unrelated to current PR

1f18795

MTN - Clarify code

e661adb

Valentin-Laurent force-pushed the binary-risk-control-v1 branch from d08fbea to e661adb Compare August 27, 2025 12:55

Valentin-Laurent added 9 commits August 27, 2025 15:16

TEST - Fix test following handling of undefined risk

f232d5d

FIX - Fix typing issues in Python 3.9, revert CI back to normal

df343ca

WIP - try to fix typing (can't reproduce locally)

8bf31fa

WIP - try to fix typing (can't reproduce locally)

c819bcd

WIP - try to fix typing (can't reproduce locally)

6b4fff5

WIP - try to fix typing (can't reproduce locally)

e428c89

WIP - try to fix typing (can't reproduce locally)

78017be

WIP - try to fix typing (can't reproduce locally)

54aac1e

WIP - try to fix typing (can't reproduce locally)

d8e615f

Valentin-Laurent added 4 commits August 28, 2025 18:12

ENH - Add theoretical validity notebook to documentation

dda38d5

FIX - Fix theoretical validity notebook

42f48b2

FIX - Fix implementation error in BinaryClassificationController, imp…

a3a1ec2

…rove theoretical test notebook

ENH - Final tweaks to theoretical_validity_tests.ipynb

b96d0a1

Valentin-Laurent marked this pull request as ready for review August 29, 2025 15:59

Valentin-Laurent merged commit 2dad52d into binary-risk-control Aug 29, 2025
6 checks passed

Valentin-Laurent deleted the binary-risk-control-v1 branch August 29, 2025 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Binary risk control - implementation batch 1 #735

Binary risk control - implementation batch 1 #735

Uh oh!

Valentin-Laurent commented Jul 29, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Aug 28, 2025

Uh oh!

Uh oh!

Uh oh!

Binary risk control - implementation batch 1 #735

Binary risk control - implementation batch 1 #735

Uh oh!

Conversation

Valentin-Laurent commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Aug 28, 2025

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Valentin-Laurent commented Jul 29, 2025 •

edited

Loading