Allow custom metrics to be reported in results #173

PGijsbers · 2020-10-07T14:23:44Z

As far as I could tell, the custom metric could be used for optimization, but was never reported.
I set up a separate branch because I wasn't sure if this was correct (maybe I misinterpreted the documentation on the pr).

In general, we can expect AutoML frameworks to expect different signatures for their metric function. For example, TPOT requires a scikit-learn scorer:

from sklearn.metrics import make_scorer

# Make a custom metric function
def my_custom_accuracy(y_true, y_pred):
    y_true = y_true.ravel()  # amlb provides a (N,1)-vector but we need (N,)
    y_pred = y_pred.ravel()  # probably not required
    score = float(sum(y_pred == y_true)) / len(y_true)
    return score

# Make a custom a scorer from the custom metric function
foo = make_scorer(my_custom_accuracy, greater_is_better=True)

However we want a uniform signature for our own score calculations. So in addition to defining the custom metric for the automl framework(s), one needs to be defined for the automl benchmark. This be a function with the same name as the metric, but with a trailing underscore. The signature is Callable[[amlb.results.Result], float]:

def foo_(result):
    return my_custom_accuracy(result.truth, result.predictions)

This provides the user with result.truth, result.predictions and, if applicable, result.probabilities.

In general, we can expect AutoML frameworks to expect different signatures for their metric function. We define a new signature specific to the amlb, so that we can also report the scores. This will be denoted with a trailing underscore ("metric_"). It allows the user to define two methods, e.g. Accuracy and Accuracy_, the former will be used by the automl framework, the latter by us.

There is no automl framework that is going to have the format that the amlb uses, so sharing the definition makes no sense until we alter the signature in the amlb.

PGijsbers · 2020-10-07T14:26:10Z

This PR should be merged (or rejected) before #141.

sebhrusen · 2020-10-12T20:15:56Z

There's a difficulty here. If we take TPOT as an example, then the training and the result processing are not done on the same process/venv, this means that if the tpot_extensions.py has a dependency on tpot module, then we will be able to load the extension in the training process, but it will fail in the one processing results...

The lookup logic suggested in #141 is then probably too simple.
Maybe we need something like:

extensions_files: 
   autosklearn:  '{user}/autosklearn_extensions.py'
   autogluon:  '{user}/autogluon_extensions.py'
   tpot: ['{user}/tpot_extensions.py', '{user}/sklearn_extensions.py']
   default:  '{user}/extensions.py'

Now let's imagine that the tpot_extensions.py can be only be loaded in the tpot venv. Then, we can still provide extensions for the results that only require sklearn: to report the custom metrics in the results, the requirement is that it relies only on sklearn, numpy, pandas, scipy..., that is the dependencies available in the main process.

I think you can still merge this PR, I will probably consider this to improve #141 though.

sebhrusen

Nice addition, thx

PGijsbers added 2 commits October 7, 2020 15:57

Remove the check for custom metric without '_'

ba3969d

There is no automl framework that is going to have the format that the amlb uses, so sharing the definition makes no sense until we alter the signature in the amlb.

PGijsbers mentioned this pull request Oct 7, 2020

Custom functions: e.g. custom metrics #141

Open

sebhrusen self-requested a review October 12, 2020 20:16

sebhrusen approved these changes Oct 12, 2020

View reviewed changes

PGijsbers merged commit 12ba65c into improv/extensions Oct 13, 2020

PGijsbers deleted the improv/more_extensions branch October 13, 2020 08:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow custom metrics to be reported in results #173

Allow custom metrics to be reported in results #173

Uh oh!

PGijsbers commented Oct 7, 2020 •

edited by sebhrusen

Loading

Uh oh!

PGijsbers commented Oct 7, 2020

Uh oh!

sebhrusen commented Oct 12, 2020 •

edited

Loading

Uh oh!

sebhrusen left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Allow custom metrics to be reported in results #173

Allow custom metrics to be reported in results #173

Uh oh!

Conversation

PGijsbers commented Oct 7, 2020 • edited by sebhrusen Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PGijsbers commented Oct 7, 2020

Uh oh!

sebhrusen commented Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sebhrusen left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PGijsbers commented Oct 7, 2020 •

edited by sebhrusen

Loading

sebhrusen commented Oct 12, 2020 •

edited

Loading