Fix ROC-AUC for classifiers #475

qomhmd · 2025-01-04T17:20:00Z

In the original package, ROC-AUC score is calculated using y_preds instead of y_scores. This pull request tries to resolve the issue for classifiers.

… requirements path

…ges to dev

…o gh-pages

Master

shankarpandala · 2025-11-30T11:47:30Z

Review Status: Needs Investigation

This PR claims to fix ROC-AUC calculation by using y_score instead of y_pred.

⚠️ Questions:

Can you provide evidence that the current implementation is incorrect?
What specific issue does this fix? (test case, example, or reproducible error)
The PR has +110 -75 changes across 5 files - what else was modified beyond ROC-AUC?

�� Next Steps:

@qomhmd Please provide:

Minimal reproducible example showing the bug
Explanation of what other changes are in this PR
Unit tests demonstrating the fix
Rebase on current dev branch (PR is 10 months old)

Current ROC-AUC Implementation:

The current code uses roc_auc_score which accepts:

Binary classification: predictions or probabilities
Multi-class: requires predict_proba with specific parameters

Need to verify if this is actually a bug or if the current implementation is correct for the use case.

Decision: On Hold pending author clarification

shankarpandala · 2025-12-03T17:18:08Z

Closing as this issue has already been fixed in the current codebase.

Current Implementation (lines 585-610 in Supervised.py)

The ROC-AUC calculation already uses probabilities, not predictions:

# Use predict_proba for ROC-AUC calculation instead of class labels
if hasattr(pipe, "predict_proba"):
    y_pred_proba = pipe.predict_proba(X_test)
    # For binary classification, use probabilities of positive class
    if y_pred_proba.shape[1] == 2:
        roc_auc = roc_auc_score(y_test, y_pred_proba[:, 1])
    else:
        # For multiclass, use one-vs-rest with probabilities
        roc_auc = roc_auc_score(y_test, y_pred_proba, multi_class='ovr', average='weighted')
elif hasattr(pipe, "decision_function"):
    # For models without predict_proba but with decision_function
    y_pred_score = pipe.decision_function(X_test)
    roc_auc = roc_auc_score(y_test, y_pred_score)
else:
    # Fallback to class labels if neither method is available
    roc_auc = roc_auc_score(y_test, y_pred)

This handles:

✅ Binary classification with probabilities
✅ Multiclass with OVR strategy
✅ Models with decision_function
✅ Fallback for models without probability estimates

Conclusion

The fix you proposed has already been implemented (or was implemented independently). The current code correctly uses y_pred_proba instead of y_pred for ROC-AUC calculation.

Thank you for identifying this issue - it's already resolved in the current version!

shankarpandala and others added 7 commits November 2, 2024 18:36

update CI workflow: add push trigger for dev branch and adjust Sphinx…

163e5f7

… requirements path

update CI workflow: change documentation deployment branch from gh-pa…

30a7cf7

…ges to dev

added gh-pages directory

cbf9ece

update CI workflow: change documentation deployment branch from dev t…

1022c6e

…o gh-pages

removed gh-pages

785d6be

Merge pull request shankarpandala#473 from shankarpandala/master

dddcdce

Master

Fix ROC-AUC for classifiers

7d19304

qomhmd mentioned this pull request Jan 5, 2025

ROC-AUC calculation #425

Closed

Mo Qo and others added 3 commits February 2, 2025 19:35

Add weighted precision and recall to the table

50678c6

Sort by ROC-AUC

067604c

Make ColumnTransformer optional

c1427a0

shankarpandala force-pushed the dev branch from 52e3d08 to 81c27db Compare March 30, 2025 17:28

qomhmd added 3 commits May 27, 2025 17:15

Add AUPRC

2385fcc

Update __init__.py version to 0.2.13

3d53687

Update setup.py version 0.2.13

09558cc

shankarpandala closed this Dec 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix ROC-AUC for classifiers #475

Fix ROC-AUC for classifiers #475

Uh oh!

qomhmd commented Jan 4, 2025

Uh oh!

shankarpandala commented Nov 30, 2025

Uh oh!

shankarpandala commented Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix ROC-AUC for classifiers #475

Fix ROC-AUC for classifiers #475

Uh oh!

Conversation

qomhmd commented Jan 4, 2025

Uh oh!

shankarpandala commented Nov 30, 2025

Review Status: Needs Investigation

⚠️ Questions:

��� Next Steps:

Current ROC-AUC Implementation:

Decision: On Hold pending author clarification

Uh oh!

shankarpandala commented Dec 3, 2025

Current Implementation (lines 585-610 in Supervised.py)

Conclusion

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

�� Next Steps: