-
Notifications
You must be signed in to change notification settings - Fork 2k
Description
Hey there,
Some production code used for training models broke when no Stacked Ensembles were trained during several AutoML runs using the latest R version of H2O. Upon further inspection, we were able to reproduce the issue with the following code… it appears that specifying {{max_models}} creates a situation that contradicts documentation indicating Stacked Ensembles are always trained as part of AutoML, so reporting that with a reprex here:
{code:r}library(tidyverse)
library(h2o)
data(iris)
h2o.init()
iris_df <- iris %>% as_tibble()
iris_df_h2o <- iris_df %>% as.h2o()
Stacked Ensemble does generate
aml <-
h2o.automl(
y = 'Species',
training_frame = iris_df_h2o,
max_runtime_secs = 60
)
Stacked Ensemble does not generate
aml2 <-
h2o.automl(
y = 'Species',
training_frame = iris_df_h2o,
max_runtime_secs = 60,
max_models = 50,
seed = 1,
exploitation_ratio = .05
)
Stacked Ensemble does not generate
aml3 <-
h2o.automl(
y = 'Species',
training_frame = iris_df_h2o,
max_runtime_secs = 60,
max_models = 50,
seed = 1#,
# exploitation_ratio = .05
)
Stacked Ensemble does not generate
aml4 <-
h2o.automl(
y = 'Species',
training_frame = iris_df_h2o,
max_runtime_secs = 60,
max_models = 50#,
# seed = 1,
# exploitation_ratio = .05
)
Stacked Ensemble DOES generate
aml5 <-
h2o.automl(
y = 'Species',
training_frame = iris_df_h2o,
max_runtime_secs = 60,
# max_models = 50,
seed = 1,
exploitation_ratio = .05
){code}