Releases: alteryx/evalml
Releases · alteryx/evalml
v0.31.0
v0.31.0 Aug. 19, 2021
Enhancements
- Updated the high variance check in AutoMLSearch to be robust to a variety of objectives and cv scores #2622
- Use Woodwork's outlier detection for the
OutliersDataCheck#2637 - Added ability to utilize instantiated components when creating a pipeline #2643
- Sped up the all Nan and unknown check in
infer_feature_types#2661
Fixes
Changes
- Deleted
_put_into_original_orderhelper function #2639 - Refactored time series pipeline code using a time series pipeline base class #2649
- Renamed
dask_teststoparallel_tests#2657 - Removed commented out code in
pipeline_meta.py#2659
Documentation Changes
- Add complete install command to README and Install section #2627
Testing Changes
Breaking Changes
TimeSeriesRegressionPipelineno longer inherits fromTimeSeriesRegressionPipeline#2649
v0.30.2
v0.30.2 Aug. 16, 2021
Fixes
- Updated changelog and version numbers to match the release. Release 0.30.1 was released erroneously without a change to the version numbers. 0.30.2 replaces it.
v0.30.1
v0.30.1 Aug. 13, 2021
Enhancements
- Added
DatetimeFormatDataCheckfor time series problems #2603 - Added
ProphetRegressorto estimators #2242 - Updated
ComponentGraphto handle not calling samplers' transform during predict, and updated samplers' transform methods s.t.fit_transformis equivalent tofit(X, y).transform(X, y)#2583 - Updated
ComponentGraph_validate_component_dictlogic to be stricter about input values #2599 - Patched bug in
xgboostestimators where predicting on a feature matrix of only booleans would throw an exception. #2602 - Updated
ARIMARegressorto use relative forecasting to predict values #2613 - Updated to support Woodwork 0.5.1 #2610
Fixes
- Updated
get_best_sampler_for_datato consider all non-numeric datatypes as categorical for SMOTE #2590 - Fixed inconsistent test results from
TargetDistributionDataCheck#2608 - Adopted vectorized pd.NA checking for Woodwork 0.5.1 support #2626
Changes
- Renamed SMOTE samplers to SMOTE oversampler #2595
- Changed
partial_dependenceandgraph_partial_dependenceto raise aPartialDependenceErrorinstead ofValueError. This is not a breaking change becausePartialDependenceErroris a subclass ofValueError#2604 - Cleaned up code duplication in
ComponentGraph#2612
Documentation Changes
- To avoid local docs build error, only add warning disable and download headers on ReadTheDocs builds, not locally #2617
Testing Changes
- Changed the lint CI job to only check against python 3.9 via the
-tflag #2586 - Installed Prophet in linux nightlies test and fixed
test_all_components#2598 - Refactored and fixed all
make_pipelinetests to assert correct order and address new Woodwork Unknown type inference #2572 - Removed
component_graphsas a global variable intest_component_graphs.py#2609
Breaking Changes
- Renamed SMOTE samplers to SMOTE oversampler. Please use
SMOTEOversampler,SMOTENCOversampler,SMOTENOversamplerinstead ofSMOTESampler,SMOTENCSampler, andSMOTENSampler#2595
v0.30.0
v0.30.0 Aug. 4, 2021
Enhancements
- Added
LogTransformerandTargetDistributionDataCheck#2487 - Issue a warning to users when a pipeline parameter passed in isn't used in the pipeline #2564
- Added Gini coefficient as an objective #2544
- Added
reprtoComponentGraph#2565 - Added components to extract features from
URLandEmailAddressLogical Types #2550 - Added support for
NaNvalues inTextFeaturizer#2532 - Added
SelectByTypetransformer #2531 - Added separate thresholds for percent null rows and columns in
HighlyNullDataCheck#2562 - Added support for
NaNnatural language values #2577
Fixes
- Raised error message for types
URL,NaturalLanguage, andEmailAddressinpartial_dependence#2573
Changes
- Updated
PipelineBaseimplementation for creating pipelines from a list of components #2549 - Moved
get_hyperparameter_rangestoPipelineBaseclass from automl/utils module #2546 - Renamed
ComponentGraph'sget_parentstoget_inputs#2540 - Removed
ComponentGraph.linearized_component_graphandComponentGraph.from_list#2556 - Updated
ComponentGraphto enforce requiring.xand.yinputs for each component in the graph #2563
Documentation Changes
- Added documentation for
DaskEngineandCFEngineparallel engines #2560 - Improved detail of
TextFeaturizerdocstring and tutorial #2568
Testing Changes
- Added test that makes sure
split_datadoes not shuffle for time series problems #2552
Breaking Changes
- Moved
get_hyperparameter_rangestoPipelineBaseclass from automl/utils module #2546 - Renamed
ComponentGraph'sget_parentstoget_inputs#2540 - Removed
ComponentGraph.linearized_component_graphandComponentGraph.from_list#2556 - Updated
ComponentGraphto enforce requiring.xand.yinputs for each component in the graph #2563
v0.29.0
v0.29.0 Jul. 22, 2021
Enhancements
- Updated 1-way partial dependence support for datetime features #2454
- Added details on how to fix error caused by broken ww schema #2466
- Added ability to use built-in pickle for saving AutoMLSearch #2463
- Updated our components and component graphs to use latest features of ww 0.4.1, e.g.
concat_columnsand drop in-place. #2465 - Added new, concurrent.futures based engine for parallel AutoML #2506
- Added support for new Woodwork
Unknowntype in AutoMLSearch #2477 - Updated our components with an attribute that describes if they modify features or targets and can be used in list API for pipeline initialization #2504
- Updated
ComponentGraphto accept X and y as inputs #2507 - Removed unused
TARGET_BINARY_INVALID_VALUESfromDataCheckMessageCodeenum and fixed formatting of objective documentation #2520
Fixes
- Fixed
FraudCostobjective and reverted threshold optimization method for binary classification toGolden#2450 - Added custom exception message for partial dependence on features with scales that are too small #2455
- Ensures the typing for Ordinal and Datetime ltypes are passed through _retain_custom_types_and_initalize_woodwork #2461
- Updated to work with Pandas 1.3.0 #2442
- Updated to work with sktime 0.7.0 #2499
Changes
- Updated XGBoost dependency to
>=1.4.2#2484, #2498 - Added a
DeprecationWarningabout deprecating the list API forComponentGraph#2488 - Updated
make_pipelinefor AutoML to create dictionaries, not lists, to initialize pipelines #2504 - No longer installing graphviz on windows in our CI pipelines because release 0.17 breaks windows 3.7 #2516
Documentation Changes
- Moved docstrings from
__init__to class pages, added missing docstrings for missing classes, and updated missing default values #2452 - Build documentation with sphinx-autoapi #2458
- Change
autoapi_ignoreto only ignore files inevalml/tests/*#2530
Testing Changes
- Fixed flaky dask tests #2471
- Removed shellcheck action from
build_conda_pkgaction #2514 - Added a tmp_dir fixture that deletes its contents after tests run #2505
- Added a test that makes sure all pipelines in
AutoMLSearchget the same data splits #2513 - Condensed warning output in test logs #2521
Breaking Changes
NaNvalues in theNatural Languagetype are no longer supported by the Imputer with the pandas upgrade. #2477
v0.28.0
v0.28.0 Jul. 2, 2021
Enhancements
- Added support for showing a Individual Conditional Expectations plot when graphing Partial Dependence #2386
- Exposed
thread_countfor Catboost estimators asn_jobsparameter #2410 - Updated Objectives API to allow for sample weighting #2433
Fixes
- Deleted unreachable line from
IterativeAlgorithm#2464
Changes
- Pinned Woodwork version between 0.4.1 and 0.4.2 #2460
- Updated psutils minimum version in requirements #2438
- Updated
log_error_callbackto not include filepath in logged message #2429
Documentation Changes
Testing Changes
v0.27.0
v0.27.0 Jun. 23, 2021
Enhancements
- Adds force plots for prediction explanations #2157
- Removed self-reference from
AutoMLSearch#2304 - Added support for nonlinear pipelines for
generate_pipeline_code#2332 - Added
inverse_transformmethod to pipelines #2256 - Add optional automatic update checker #2350
- Added
search_ordertoAutoMLSearch'srankingsandfull_rankingstables #2345 - Updated threshold optimization method for binary classification #2315
- Updated demos to pull data from S3 instead of including demo data in package #2387
- Upgrade woodwork version to v0.4.1 #2379
Fixes
- Preserve user-specified woodwork types throughout pipeline fit/predict #2297
- Fixed
ComponentGraphappending target tofinal_component_featuresif there is a component that returns both X and y #2358 - Fixed partial dependence graph method failing on multiclass problems when the class labels are numeric #2372
- Added
thresholding_objectiveargument toAutoMLSearchfor binary classification problems #2320 - Added change for
k_neighborsparameter in SMOTE Oversamplers to automatically handle small samples #2375 - Changed naming for
Logistic Regression Classifierfile #2399 - Pinned pytest-timeout to fix minimum dependence checker #2425
- Replaced
Elastic Net Classifierbase class withLogistsic Regressionto avoidNaNoutputs #2420
Changes
- Cleaned up
PipelineBase'scomponent_graphand_component_graphattributes. UpdatedPipelineBase__repr__and added__eq__forComponentGraph#2332 - Added and applied
blacklinting package to the EvalML repo in place ofautopep8#2306 - Separated
custom_hyperparametersfrom pipelines and added them as an argument toAutoMLSearch#2317 - Replaced
allowed_pipelineswithallowed_component_graphs#2364 - Removed private method
_compute_features_during_fitfromPipelineBase#2359 - Updated
compute_orderinComponentGraphto be a read-only property #2408 - Unpinned PyZMQ version in requirements.txt #2389
- Uncapping LightGBM version in requirements.txt #2405
- Updated minimum version of plotly #2415
- Removed
SensitivityLowAlertobjective from core objectives #2418
Documentation Changes
- Fixed lead scoring weights in the demos documentation #2315
- Fixed start page code and description dataset naming discrepancy #2370
Testing Changes
- Update minimum unit tests to run on all pull requests #2314
- Pass token to authorize uploading of codecov reports #2344
- Add
pytest-timeout. All tests that run longer than 6 minutes will fail. #2374 - Separated the dask tests out into separate github action jobs to isolate dask failures. #2376
- Refactored dask tests #2377
- Added the combined dask/non-dask unit tests back and renamed the dask only unit tests. #2382
- Sped up unit tests and split into separate jobs #2365
- Change CI job names, run lint for python 3.9, run nightlies on python 3.8 at 3am EST #2395 #2398
- Set fail-fast to false for CI jobs that run for PRs #2402
Breaking Changes
AutoMLSearchwill acceptallowed_component_graphsinstead ofallowed_pipelines#2364- Removed
PipelineBase's_component_graphattribute. UpdatedPipelineBase__repr__and added__eq__forComponentGraph#2332 pipeline_parameterswill no longer acceptskopt.spacevariables since hyperparameter ranges will now be specified throughcustom_hyperparameters#2317
v0.25.0
v0.25.0 Jun. 2, 2021
Enhancements
- Upgraded minimum woodwork to version 0.3.1. Previous versions will not be supported #2181
- Added a new callback parameter for
explain_predictions_best_worst#2308
Fixes
Changes
- Deleted the
return_pandasflag from our demo data loaders #2181
Documentation Changes
Testing Changes
- Ignoring
test_saving_png_filewhile building conda package #2323
Breaking Changes
v0.24.2
v0.24.2 May. 24, 2021
Enhancements
- Added oversamplers to AutoMLSearch #2213 #2286
- Added dictionary input functionality for
Undersamplercomponent #2271 - Changed the default parameter values for
Elastic Net ClassifierandElastic Net Regressor#2269
Fixes
- Set default
n_jobsto 1 forStackedEnsembleClassifierandStackedEnsembleRegressoruntil fix for text-based parallelism in sklearn stacking can be found #2295
Changes
- Updated
start_iteration_callbackto accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290
Documentation Changes
Testing Changes
- Use codecov action to update coverage reports #2238
- Removed MarkupSafe dependency version pin from requirements.txt and moved instead into RTD docs build CI #2261
Breaking Changes
- Updated
start_iteration_callbackto accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290
v0.24.1
v0.24.1 May. 17, 2021
Enhancements
- Integrated
ARIMARegressorinto AutoML #2009 - Updated
HighlyNullDataCheckto also perform a null row check #2222 - Set
max_depthto 1 in calls to featuretools dfs #2231
Fixes
- Removed data splitter sampler calls during training #2253
- Set minimum required version for for pyzmq, colorama, and docutils #2254
- Changed BaseSampler to return None instead of y #2272
Changes
- Updated pipeline
repr()andgenerate_pipeline_codeto return pipeline instances without generating custom pipeline class #2227
Documentation Changes
- Capped Sphinx version under 4.0.0 #2244
Testing Changes
- Change number of cores for pytest from 4 to 2 #2266
- Add minimum dependency checker to generate minimum requirement files #2267
(evalml_dev) RM-MB-151:evalml karsten.chu$ tools/format_release_notes.sh
v0.24.1 May. 17, 2021
Enhancements
- Integrated
ARIMARegressorinto AutoML #2009 - Updated
HighlyNullDataCheckto also perform a null row check #2222 - Set
max_depthto 1 in calls to featuretools dfs #2231
Fixes
- Removed data splitter sampler calls during training #2253
- Set minimum required version for for pyzmq, colorama, and docutils #2254
- Changed BaseSampler to return None instead of y #2272
Changes
- Updated pipeline
repr()andgenerate_pipeline_codeto return pipeline instances without generating custom pipeline class #2227
Documentation Changes
- Capped Sphinx version under 4.0.0 #2244