Skip to content

Commit c001e1c

Browse files
author
Rahul Iyer
committed
Build: Release notes for v1.7
1 parent 00a42b5 commit c001e1c

File tree

1 file changed

+54
-0
lines changed

1 file changed

+54
-0
lines changed

ReleaseNotes.txt

Lines changed: 54 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,60 @@ A complete list of changes for each release can be obtained by viewing the git
88
commit history located at https://github.com/madlib/madlib/commits/master.
99

1010
Current list of bugs and issues can be found at http://jira.madlib.net.
11+
--------------------------------------------------------------------------------
12+
MADlib v1.7
13+
14+
Release Date: 2014-December-31
15+
16+
New features:
17+
* Generalized Linear Model:
18+
- Added a new generic module for GLM functions that allow for response
19+
variables that have arbitrary distributions (rather than simply
20+
Gaussian distributions), and for an arbitrary function of the response
21+
variable (the link function) to vary linearly with the predicted values
22+
(rather than assuming that the response itself must vary linearly).
23+
- Available distribution families: gaussian (link functions: identity,
24+
inverse and log), binomial (link functions: probit and logit),
25+
poisson (link functions: log, identity and square-root), gamma (link
26+
functions: inverse, identity and log) and inverse gaussian (link functions:
27+
square-inverse, inverse, identity and log).
28+
- Deprecated 'mlogregr_train' in favor of 'multinom' available as part of
29+
the new GLM functionality.
30+
- Added a new 'ordinal' function for ordered logit and probit regression.
31+
* Decision Tree: Reimplemented the decision tree module which includes following
32+
changes:
33+
- Improved usability due to a new interface.
34+
- Performance enhancements upto 40 times faster than the old interface.
35+
- Additional features like pruning methods, surrogate variables for
36+
NULL handling, cross validation, and various new tree tuning parameters.
37+
- Addition of a new display function to visualize the trained tree and new
38+
prediction function for scoring of new datasets.
39+
* Random Forest: Reimplemented the random forest module which includes following
40+
changes:
41+
- New random forest module based on the new decision tree module.
42+
- Better variable importance metrics and ability to explore each tree
43+
in the forest independently.
44+
- Ability to get class probabilities of all classes and not just the max
45+
class during prediction.
46+
- Improved visualization with export capabilities using Graphviz dot format.
47+
* PMML:
48+
- Upgraded compatible PMML version to 4.1.
49+
- Moved PMML export out of early stage development with new functionality
50+
available to export GLM, decision tree, and random forest models.
51+
* Updated Eigen from 3.1.2 to 3.2.2.
52+
* Updated PyXB from 1.2.3 to 1.2.4.
53+
* Added finer granularity control for running specific install-check tests.
54+
55+
Bug fixes:
56+
- Fixed bug in K-means allowing use of user-defined metric functions
57+
(MADLIB-874, MADLIB-875).
58+
- Fixed issues related to header files included in the build system
59+
(MADLIB-855, MADLIB-879, MADLIB-884).
60+
61+
Known issues:
62+
- Performance for decision tree with cross-validation is poor on a HAWQ
63+
multi-node system.
64+
1165
--------------------------------------------------------------------------------
1266
MADlib v1.6
1367

0 commit comments

Comments
 (0)