Composite option, dataString compression, and correct splits percentage#40
Composite option, dataString compression, and correct splits percentage#40pjaselin wants to merge 19 commits intotopepo:masterfrom
Conversation
added committee and neighbor options
returned the neighbors parameter to the predict method
|
Also, giving the composite option should help accelerate prediction times for cases like #28 |
|
Thanks for doing this; it's great to have an outside contribute. I've been looking at it for a few days and I'd like to tweak the ui. It's a little awkward to have an argument that could be logical or character. How about we
I think that would get us to the same place. Also, the current specification breaks a lot of existing analyses (in books, vignettes, and so on). I'd like it to be backward compatible and defaulting to |
|
Hi @topepo, I really appreciate your feedback and ideas here. Also thank you for allowing me to contribute! This is a much cleaner UI and I think I'll modify my Python implementation to match it.
Also definitely please make sure this all passes your tests beyond mine! |
fixed vignettes
removed model from cv output
Hi @topepo! I've been working on a port of your code into Python (I believe Kirk mentioned that) over here: https://github.com/pjaselin/Cubist. Thank you so much for all the work you are your colleagues have done on this!
Some improvements/fixes I've made here:
Cubist/R/cubist.R
Line 212 in 548ccd7
so I came up with a way to get the right comparison operator based on the model. (This is probably the one definite fix here)
Let me know if you'd like to break this apart and I'd be happy to take feedback!