Skip to content
Discussion options

You must be logged in to vote

Each slice should have at least an overall split to specify that it hasn't been sliced.

Also, the method log_quantitative_analysis is used once per metric. So only one pass/fail threshold can be specified for it. The notebooks for the use-cases are a useful demo for the function. Here's a snippet from the heart_failure_prediction.ipynb notebook that shows how the metrics from the evaluator are logged:

 results_female_flat = flatten_results_dict(
    results=results_female,
    model_name=model_name,
)
# ruff: noqa: W505
for name, metric in results_female_flat.items():
    split, name = name.split("/")  # noqa: PLW2901
    descriptions = {
        "BinaryPrecision": "The proportion of pred…

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by rjavadi
Comment options

rjavadi
Mar 15, 2024
Maintainer Author

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested
2 participants