Add `EqualizedOddsImprovement` metric #775

fealho · 2025-06-17T21:01:33Z

CU-86b5ayqd6, Resolve #772

sdv-team · 2025-06-17T21:01:38Z

Task linked: CU-86b5ayqd6 SDMetrics - Add a fairness metric that computes Equalized Odds #772

codecov · 2025-06-17T21:07:41Z

Codecov Report

Attention: Patch coverage is 98.89503% with 2 lines in your changes missing coverage. Please review.

Project coverage is 95.73%. Comparing base (48e7ee5) to head (9de8f92).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
sdmetrics/single_table/equalized_odds.py	98.37%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #775      +/-   ##
==========================================
+ Coverage   95.64%   95.73%   +0.09%     
==========================================
  Files         115      117       +2     
  Lines        4590     4736     +146     
==========================================
+ Hits         4390     4534     +144     
- Misses        200      202       +2

Flag	Coverage Δ
integration	`80.91% <93.37%> (+0.54%)`	⬆️
unit	`84.14% <86.74%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

R-Palazzo

This is looking good!

Could you add a integration where the sensitive_column_value is np.nan

tests/unit/single_table/test_equalized_odds.py

R-Palazzo · 2025-06-24T16:39:07Z

sdmetrics/single_table/equalized_odds.py

+        for is_sensitive_group in [True, False]:
+            group_predictions = prediction_binary[sensitive_binary == is_sensitive_group]
+            group_name = 'sensitive' if is_sensitive_group else 'non-sensitive'
+
+            if len(group_predictions) == 0:
+                raise ValueError(f'No data found for {group_name} group.')
+
+            positive_count = group_predictions.sum()
+            negative_count = len(group_predictions) - positive_count
+
+            if positive_count < 5 or negative_count < 5:
+                raise ValueError(
+                    f'Insufficient data for {group_name} group: {positive_count} positive, '
+                    f'{negative_count} negative examples (need ≥5 each).'


Do we need a for loop here since we are counting both positive and negative?

Yes. There are two groups (1) data that matches the sensitive value and (2) data that doesn't. For both cases we need at least 5 True target values and 5 False target values.

R-Palazzo · 2025-06-24T16:40:54Z

sdmetrics/single_table/equalized_odds.py

+        data[sensitive_column_name] = (
+            data[sensitive_column_name] == sensitive_column_value
+        ).astype(int)


Does it work if the sensitive_column_value is np.nan?

A good question. If the column is categorical the nans are just another category, since the column dtype is object, so the user should pass 'nan', 'None', or whatever the string representation they are using for missing values is.

It won't reach the line of code you are referring if the user passes np.nan, since the data validation before this will complain np.nan is not present in the data. I added a test showing this.

If the data is numerical it would indeed crash. I added new logic to handle it both in these lines of code and the validation.

sdmetrics/single_table/equalized_odds.py

R-Palazzo

LGTM!

frances-h

Nice work!

fealho force-pushed the issue-772-equalized-odds branch 3 times, most recently from 85aadc3 to 6e3fbca Compare June 23, 2025 16:07

Add EqualizedOddsImprovement

482328d

fealho force-pushed the issue-772-equalized-odds branch from 6e3fbca to 482328d Compare June 23, 2025 16:46

fealho marked this pull request as ready for review June 23, 2025 16:47

fealho requested a review from a team as a code owner June 23, 2025 16:47

fealho requested review from R-Palazzo and frances-h and removed request for a team June 23, 2025 16:47

Fix validation methods

7a70825

R-Palazzo reviewed Jun 24, 2025

View reviewed changes

fealho requested a review from R-Palazzo June 25, 2025 03:35

Feedback

9de8f92

R-Palazzo approved these changes Jun 26, 2025

View reviewed changes

frances-h approved these changes Jun 26, 2025

View reviewed changes

fealho merged commit ac5450a into main Jun 26, 2025
57 checks passed

fealho deleted the issue-772-equalized-odds branch June 26, 2025 20:17

Add EqualizedOddsImprovement metric #775

Add EqualizedOddsImprovement metric #775

Uh oh!

Conversation

fealho commented Jun 17, 2025

Uh oh!

sdv-team commented Jun 17, 2025

Uh oh!

codecov bot commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

R-Palazzo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

R-Palazzo Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

fealho Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

R-Palazzo Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

fealho Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

R-Palazzo left a comment

Choose a reason for hiding this comment

Uh oh!

frances-h left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add `EqualizedOddsImprovement` metric #775

Add `EqualizedOddsImprovement` metric #775

codecov bot commented Jun 17, 2025 •

edited

Loading

fealho Jun 25, 2025 •

edited

Loading