ENH - Improve theoretical validation notebook #776

Valentin-Laurent · 2025-10-16T13:02:22Z

No description provided.

Copilot

Pull Request Overview

This PR improves the theoretical validation notebook by replacing the previous random classifier with a more realistic logistic (sigmoid-based) classifier. The notebook now tests whether the BinaryClassificationController successfully controls risk under various parameter combinations. Key changes include updating the classifier implementation, refining experimental logic, and revising explanatory text to reflect the new setup.

Key Changes

Replaced RandomClassifier with LogisticClassifier that uses a deterministic sigmoid function
Updated validation logic to use empirical metrics (precision_score, recall_score, accuracy_score) on test data
Changed target levels from [0.1, 0.9] to [0.1, 0.7] to better match the classifier's capabilities

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-16T13:03:25Z

notebooks/risk_control/theoretical_validity_tests.ipynb

+    "    {\"name\": \"accuracy\", \"risk\": accuracy},\n",
+    "]\n",
+    "predict_params =  [np.linspace(0, 0.99, 100), np.array([0.5])]\n",
+    "target_level = [0.1, 0.9]\n",


The target_level list includes 0.9, but this value was removed from the actual experiments shown in the output. The code comment and execution results reflect target levels of [0.1, 0.7]. This inconsistency between the code and its output suggests the value should be 0.7 instead of 0.9.

Suggested change

"target_level = [0.1, 0.9]\n",

"target_level = [0.1, 0.7]\n",

notebooks/risk_control/theoretical_validity_tests.ipynb

Copilot · 2025-10-16T13:03:26Z

notebooks/risk_control/theoretical_validity_tests.ipynb

+    "        # In the following, we check that all the valid thresholds found by LTT actually control the risk.\n",
+    "        # We sample a large test set and estimate the risk for each valid_parameters usiing the logistic classifier.\n",
+    "        X_test, y_test = make_classification(\n",
+    "            n_samples=100,\n",


[nitpick] The test set size is hard-coded to 100 samples. Consider extracting this as a named constant (e.g., TEST_SIZE = 100) at the beginning of the notebook to improve maintainability and make it easier to adjust for different validation scenarios.

Copilot · 2025-10-16T13:03:26Z

notebooks/risk_control/theoretical_validity_tests.ipynb

-    "        local_seed = hash((x, self.seed)) % (2**32)\n",
-    "        rng = np.random.RandomState(local_seed)\n",
-    "        return np.round(rng.rand(), 2)\n",
+    "        \"\"\"Probability of class 1 for input x.\"\"\"\n",


The magic numbers 0.3 and 1.0 defining the probability bounds lack explanation. Add a comment explaining why these specific bounds were chosen for the sigmoid function and their impact on the classifier's behavior.

Suggested change

" \"\"\"Probability of class 1 for input x.\"\"\"\n",

" \"\"\"Probability of class 1 for input x.\"\"\"\n",

" # The probability bounds (inf_=0.3, sup_=1.0) restrict the output of the sigmoid function to [0.3, 1.0].\n",

" # This prevents the classifier from being overly confident (never predicting probabilities below 0.3),\n",

" # which can be useful for risk control and to avoid extreme predictions in theoretical tests.\n",

Co-authored-by: Copilot <[email protected]>

… in theoretical validity tests

hmaissoro added 3 commits October 9, 2025 17:27

ENH: change RandomClassifier to LogisticClassifier

63e04fb

ENH: improve description and comments

2e56238

ENH: change the classifier and update the subsequent part accordingly.

25fc011

Valentin-Laurent requested a review from Copilot October 16, 2025 13:02

Copilot AI reviewed Oct 16, 2025

View reviewed changes

allglc and others added 4 commits October 16, 2025 17:05

Copilot typo

9ce1f73

Co-authored-by: Copilot <[email protected]>

ENH: work in progress change random classifier to logistic classifier…

13c34a9

… in theoretical validity tests

update risk control fails

3d8640a

ENH: update affectation

a66cfdc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH - Improve theoretical validation notebook #776

ENH - Improve theoretical validation notebook #776

Uh oh!

Valentin-Laurent commented Oct 16, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 16, 2025

Uh oh!

Uh oh!

Copilot AI Oct 16, 2025

Uh oh!

Copilot AI Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	"target_level = [0.1, 0.9]\n",
	"target_level = [0.1, 0.7]\n",

-    "        \"\"\"Probability of class 1 for input x.\"\"\"\n",
+    "        \"\"\"Probability of class 1 for input x.\"\"\"\n",
+    "        # The probability bounds (inf_=0.3, sup_=1.0) restrict the output of the sigmoid function to [0.3, 1.0].\n",
+    "        # This prevents the classifier from being overly confident (never predicting probabilities below 0.3),\n",
+    "        # which can be useful for risk control and to avoid extreme predictions in theoretical tests.\n",

Uh oh!

ENH - Improve theoretical validation notebook #776

Are you sure you want to change the base?

ENH - Improve theoretical validation notebook #776

Uh oh!

Conversation

Valentin-Laurent commented Oct 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants