Commit 653b419
glum v3.0 (#677)
* Make tests green with densematrix-refactor branch
* Remove most Matrixbase subclass checks
* Simplify _group_sum
* Pre-commit autoupdate (#672)
* Use boa in CI. (#673)
* Fix covariance matrix mutating feature names (#671)
* Do not use _set_up_... in covariance_matrix
* Add changelog entry
* Add the option to store the covariance matrix to avoid recomputing it (#661)
* Add option to store covariance matrix during fit
* Fix fitting with variance matrix estimation
`.covariance_matrix()` expects X and weights in a different format than
what we have at the end of `.fit().
* Store covariance matrix after estimation
* Handle the alpha_search and glm_cv cases
* Propagate covariance parameters
* Add changelog
* Slightly more lenient tests
* Pre-commit autoupdate (#676)
Co-authored-by: quant-ranger[bot] <132915763+quant-ranger[bot]@users.noreply.github.com>
* Fix covariance_matrix dtypes
* Make CI use pre-release tabmat
* Column names à la Tabmat #278 (#678)
* Delegate column naming to tabmat
* Add tests
* More tests
* Test for dropping complete categories
* Add docstrings for new argument
* Add changelog entry
* Convert to pandas at the correct place
* Reorganize converting from pandas
* Remove xfail from test
* Formula interface (#670)
* Add formulaic to dependencies
* Add function for transforming the formula
* Add tests
* First draft of glum formula interface
* Fixes and tests
* Handle intercept correctly
* Add formula functionality to glm_cv
* Variables from local context
* Test predict with formulas
* Add formula tutorial
* Fix tutorial
* Reformat tutorial
* Improve function signatures adn docstrings
* Handle two-sided formulas in covariance_matrix
* Make mypy happy about module names
* Matthias' suggestions
* Improve tutorial
* Improve tutorial
* Formula- and term-based Wald-tests (#689)
* Add formulaic to dependencies
* Add function for transforming the formula
* Add tests
* First draft of glum formula interface
* Fixes and tests
* Handle intercept correctly
* Add formula functionality to glm_cv
* Variables from local context
* Test predict with formulas
* Add formula tutorial
* Fix tutorial
* Reformat tutorial
* Improve function signatures adn docstrings
* Handle two-sided formulas in covariance_matrix
* Make mypy happy about module names
* Matthias' suggestions
* Add back term-based Wald-tests
* Tests for term names
* Add formula-based Wald-test
* Tests for formula-based Wald-test
* Add changelog
* Fix exception message
* Additional test case
* make docstrings clearer in the case of terms
* Support for missing values in categorical columns (#684)
* Delegate column naming to tabmat
* Add tests
* More tests
* Test for dropping complete categories
* Add docstrings for new argument
* Add changelog entry
* Convert to pandas at the correct place
* Reorganize converting from pandas
* Remove xfail from test
* Implement missing categorical support
* Add test
* Solve adding missing category when predicting
* Apply Matthias' suggestions
* Add changelog entry
* Fix formula context (#691)
* Make tests fail
* Propagate context through methods
* pyupgrade
* ensure_full_rank != drop_first
* fix
* move feature name assignment to right spot
* fix
* remove blank line
* bump minimum formulaic version (stateful transforms)
* improve backward compatibility
* Remove code that is not needed in tabmat v4 / glum v3 (#741)
* Remove check_array from predict()
We don't need it here as predict calls linear_redictor, and the latter does this check. We can avoid doing it twice.
* Remove _name_categorical_variable parts
There is no need for those as Tabmat v4 handles variable names internally.
---------
Co-authored-by: Martin Stancsics <martin.stancsics@gmail.com>
* Fix formula test: consider presence of intercept in full rankness check when constructing the model matrix externally (#746)
* deal with intercept in formula test correctly
* naming [skip ci]
* test varying significance level in coef table test (#749)
* pin formulaic to 0.6 (#752)
* Add illustration of formula interface to example in README (#751)
* add illustration of formula to readme
* rephrase
* spacing
* add linear term for illustration
* Determine presence of intercept only by `fit_intercept` argument (#747)
* always use self.fit_intercept; raise if formula conflicts with it
* wording [skip ci]
* adjust other tests, cosmetics
* don't compare specs with singular matrix to smf
* fix smf test formula
* fix intercept in context test
* remove outdated sentence; clean up
* fix
* adjust tutorial
* adjust tutorial
* consistent linebreaks in docstring
* remove obsolete arg in docstring
* Informative error when encountering categories that were not seen in training (#748)
* drop missings not seen in training
* zero not drop
* better (?) name [skip ci]
* catch case of unseen missings and fail method
* fix
* respect categorical missing method with formula; test different categorical missing methods also with formula
* shorten the tests
* dont allow fitting in case of conversion of categoricals and presence of formula
* clearer error msg
* also change the error msg in the regex (facepalm)
* remove matches
* fix
* better name
* describe more restrictive behavior in tutorial
* Raise error on unseen levels when predicting
* Allow cat_missing_method='convert' again
* Update test
* Check for unseen categories
* Adapt align_df_categories tests to changes
* Make pre-commit happy
* Avoid unnecessary work
* Correctly expand penalties with categoricals and `cat_missing_method="convert"` (#753)
* Correctyl expand penalties when cat_missing_method=convert
* Add test
* Improve variable names
Co-authored-by: Matthias Schmidtblaicher <42544829+MatthiasSchmidtblaicherQC@users.noreply.github.com>
---------
Co-authored-by: Matthias Schmidtblaicher <42544829+MatthiasSchmidtblaicherQC@users.noreply.github.com>
* bump tabmat pre-release version
---------
Co-authored-by: Martin Stancsics <martin.stancsics@gmail.com>
* docstring cosmetics
* even more docstring cosmetics
* Do not fail when an estimator misses class members that are new in v3 (#757)
* do not fail on missing class members that are new in v3
* simplify
* convert
* shorten the comment
* simplify
* don't use getattr unnecessarily
* cosmetics
* fix unrelated typo
* tiny cosmetics [skip ci]
* No regularization as default (#758)
* set alpha=0 as default
* fix docstring
* add alpha where needed to avoid LinAlgError
* add changelog entry
* also set alpha in golden master
* change name in persisted file too
* set alpha in model_parameters again
* don't modify case of no alpha attribute, which is RegressorCV
* remove invalid alpha argument
* wording
* Improve code readability
* Make arguments to public methods except `X`, `y`, `sample_weight` and `offset` keyword-only and make initialization keyword-only (#764)
* make all args except X, y, sample_weight, offset keyword only; make initialization keyword only
* add changelog [skip ci]
* mention that also RegressorBase was changed [skip ci]
* fix import
* clean up changelog
* Restructure distributions (#768)
* Explain `scale_predictors` more (#778)
* Expand on effect of scale_predictors and remove note
* Update src/glum/_glm.py
Co-authored-by: Jan Tilly <jan.tilly@quantco.com>
* remove sentence
---------
Co-authored-by: Jan Tilly <jan.tilly@quantco.com>
* Move helpers into `_utils` (#782)
* Patch docstring
* Update CHANGELOG.rst
Co-authored-by: Luca Bittarello <15511539+lbittarello@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Luca Bittarello <15511539+lbittarello@users.noreply.github.com>
* shorten docstrings of private functions; typos in defaults; other suggestions
* context docstring
* kwargs
* no context as default; small cleanups
* add explanation to get calling scope
* adjust to tabmat release
* keep whitespace
* temporarily add tabmat_dev channel again to investigate env solving failure on CI
* remove tabmat_dev channel again
* for now, disable conda build test on osx and Python 3.12
* Add a different environment for macos (#786)
* try solving on ci with different env for macos
* add missing if
* typo
* try and remove --no-test flag
* replace deprecated scipy.sparse.*_matrix.A
* replace other instance of .A
* two more
* simply replace all instances of .A by .toarray() (tabmat knows both)
* update CHANGELOG for release
---------
Co-authored-by: quant-ranger[bot] <132915763+quant-ranger[bot]@users.noreply.github.com>
Co-authored-by: Jan Tilly <jan.tilly@quantco.com>
Co-authored-by: Marc-Antoine Schmidt <marc-antoine.schmidt@quantco.com>
Co-authored-by: Matthias Schmidtblaicher <matthias.schmidtblaicher@quantco.com>
Co-authored-by: Matthias Schmidtblaicher <42544829+MatthiasSchmidtblaicherQC@users.noreply.github.com>
Co-authored-by: Martin Stancsics <martin.stancsics@gmail.com>
Co-authored-by: Luca Bittarello <15511539+lbittarello@users.noreply.github.com>
Co-authored-by: lbittarello <luca.bittarello@gmail.com>1 parent 954abc6 commit 653b419
File tree
26 files changed
+4629
-1390
lines changed- .github/workflows
- conda.recipe
- docs/tutorials
- formula_interface
- src
- glum_benchmarks/data
- glum
- tests/glm
- golden_master
26 files changed
+4629
-1390
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
43 | | - | |
| 43 | + | |
| 44 | + | |
44 | 45 | | |
45 | 46 | | |
46 | 47 | | |
47 | 48 | | |
48 | 49 | | |
49 | 50 | | |
50 | 51 | | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
51 | 61 | | |
52 | 62 | | |
53 | 63 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
27 | | - | |
| 27 | + | |
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
10 | 32 | | |
11 | 33 | | |
12 | 34 | | |
| |||
16 | 38 | | |
17 | 39 | | |
18 | 40 | | |
19 | | - | |
| 41 | + | |
20 | 42 | | |
21 | 43 | | |
22 | 44 | | |
| |||
73 | 95 | | |
74 | 96 | | |
75 | 97 | | |
| 98 | + | |
76 | 99 | | |
77 | 100 | | |
78 | 101 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
68 | 68 | | |
69 | 69 | | |
70 | 70 | | |
71 | | - | |
| 71 | + | |
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
| |||
79 | 79 | | |
80 | 80 | | |
81 | 81 | | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
82 | 91 | | |
83 | 92 | | |
84 | 93 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
38 | | - | |
| 38 | + | |
| 39 | + | |
39 | 40 | | |
40 | 41 | | |
41 | 42 | | |
| |||
0 commit comments