|
| 1 | + |
| 2 | + |
| 3 | + |
| 4 | +For this engine, there are multiple modes: classification and regression |
| 5 | + |
| 6 | +## Tuning Parameters |
| 7 | + |
| 8 | +This model has no tuning parameters. |
| 9 | + |
| 10 | +Engine arguments of interest |
| 11 | + |
| 12 | +- `max_runtime_secs` and `max_models`: controls the maximum running time and number of models to build in the automatic process. |
| 13 | + |
| 14 | +- `exclude_algos` and `include_algos`: a character vector indicating the excluded or included algorithms during model building. To see a full list of supported models, see the details section in [h2o::h2o.automl()]. |
| 15 | + |
| 16 | +- `validation`: An integer between 0 and 1 specifying the _proportion_ of training data reserved as validation set. This is used by h2o for performance assessment and potential early stopping. |
| 17 | + |
| 18 | +## Translation from parsnip to the original package (regression) |
| 19 | + |
| 20 | +[agua::h2o_train_auto()] is a wrapper around [h2o::h2o.automl()]. |
| 21 | + |
| 22 | + |
| 23 | +```r |
| 24 | +auto_ml() %>% |
| 25 | + set_engine("h2o") %>% |
| 26 | + set_mode("regression") %>% |
| 27 | + translate() |
| 28 | +``` |
| 29 | + |
| 30 | +``` |
| 31 | +## Automatic Machine Learning Model Specification (regression) |
| 32 | +## |
| 33 | +## Computational engine: h2o |
| 34 | +## |
| 35 | +## Model fit template: |
| 36 | +## agua::h2o_train_auto(x = missing_arg(), y = missing_arg(), weights = missing_arg(), |
| 37 | +## validation_frame = missing_arg(), verbosity = NULL) |
| 38 | +``` |
| 39 | + |
| 40 | + |
| 41 | +## Translation from parsnip to the original package (classification) |
| 42 | + |
| 43 | + |
| 44 | +```r |
| 45 | +auto_ml() %>% |
| 46 | + set_engine("h2o") %>% |
| 47 | + set_mode("classification") %>% |
| 48 | + translate() |
| 49 | +``` |
| 50 | + |
| 51 | +``` |
| 52 | +## Automatic Machine Learning Model Specification (classification) |
| 53 | +## |
| 54 | +## Computational engine: h2o |
| 55 | +## |
| 56 | +## Model fit template: |
| 57 | +## agua::h2o_train_auto(x = missing_arg(), y = missing_arg(), weights = missing_arg(), |
| 58 | +## validation_frame = missing_arg(), verbosity = NULL) |
| 59 | +``` |
| 60 | + |
| 61 | +## Preprocessing requirements |
| 62 | + |
| 63 | + |
| 64 | +Factor/categorical predictors need to be converted to numeric values (e.g., dummy or indicator variables) for this engine. When using the formula method via \\code{\\link[=fit.model_spec]{fit()}}, parsnip will convert factor columns to indicators. |
| 65 | + |
| 66 | +## Initializing h2o |
| 67 | + |
| 68 | + |
| 69 | +To use the h2o engine with tidymodels, please run `h2o::h2o.init()` first. By default, This connects R to the local h2o server. This needs to be done in every new R session. You can also connect to a remote h2o server with an IP address, for more details see [h2o::h2o.init()]. |
| 70 | + |
| 71 | +You can control the number of threads in the thread pool used by h2o with the `nthreads` argument. By default, it uses all CPUs on the host. This is different from the usual parallel processing mechanism in tidymodels for tuning, while tidymodels parallelizes over resamples, h2o parallelizes over hyperparameter combinations for a given resample. |
| 72 | + |
| 73 | +h2o will automatically shut down the local h2o instance started by R when R is terminated. To manually stop the h2o server, run `h2o::h2o.shutdown()`. |
0 commit comments