tinkering on gradient-based optimization by palday · Pull Request #871 · JuliaStats/MixedModels.jl

palday · 2025-12-29T05:19:50Z

No description provided.

dmbates · 2025-12-29T15:13:53Z

Thank you for reorganizing my notes into a reasonable archive. Somehow a good git organization remains a "here be dragons" area for me.

codecov · 2025-12-30T19:03:53Z

Codecov Report

❌ Patch coverage is 66.53846% with 87 lines in your changes missing coverage. Please review.
✅ Project coverage is 93.75%. Comparing base (4e04cfd) to head (ca8d1a9).

Files with missing lines	Patch %	Lines
src/gradient.jl	64.04%	87 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #871      +/-   ##
==========================================
- Coverage   95.62%   93.75%   -1.88%     
==========================================
  Files          38       39       +1     
  Lines        3702     3954     +252     
==========================================
+ Hits         3540     3707     +167     
- Misses        162      247      +85

Flag	Coverage Δ
current	`93.45% <66.27%> (-1.85%)`	⬇️
minimum	`93.70% <66.53%> (-1.87%)`	⬇️
nightly	`93.45% <66.27%> (-1.85%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…a/gradient

dmbates · 2026-01-14T01:25:13Z

I have added some code to evaluate the diagonal block of Omega-dot for each parameter component. That is in src/gradient.jl Some explanation of what is going on is in gradients/Gradient_evaluation.qmd. Next steps are to fill out the other blocks and to add methods for Omega_dot_diag_block! for diagonal blocks further down the diagonal. In most cases these are dense matrices, rather than Diagonal or UniformBlockDiagonal but only the diagonal or the diagonal blocks are overwritten at this stage.

I did add tests for the methods I added today.

dmbates · 2026-01-14T01:29:11Z

The failure on nightly is from an old test on the HTML output which apparently is not formatted the same way as for earlier versions. I'm not sure why Style Enforcer is failing.

…a/gradient

dmbates · 2026-01-24T18:32:51Z

I added some preliminary code to evaluate the gradient using the blocked form of the derivative of $\Omega$ and the blocked L matrix. It is not polished but it works - sort of. I just updated the gradients/GradientEvaluation.qmd document with some examples using the blocked evaluation of $L^{-1}\dot{\Omega} L^{-T}$ and using the full matrix evaluation. These are compared to the finite-difference approximations. Somehow the blocked evaluation of $L^{-1}\dot{\Omega} L^{-T}$ is giving a different value of the second component of the gradient in the last example (penicillin) but I can't figure out why. It seems that the numbers used to evaluate this quantity are the same for the blocked evaluation and for the full-matrix evaluation. @palday If you can shed any light on this for me I would appreciate your doing so.

dmbates · 2026-01-25T14:36:48Z

I think I know what the problem is - I didn't zero out all the blocks that I should have prior to the blocked gradient evaluation. Should be fixed today.

dmbates · 2026-01-25T16:34:40Z

I think the initialization of the blocked form for evaluation of the gradient is now fixed. I must admit that I am uniquely skilled in confusing myself about the order in which to do such calculations but I think I have it now.

Still a WIP for the actual gradient evaluation. @palday if you have time I may ask for your advice on how to structure the code, which is kind-of all over the place right now.

I will run it through the formatter and commit that version to try to cut down on error messages from commits.

dmbates · 2026-01-26T22:04:40Z

The development of the gradient evaluation is currently in src/gradient.jl.

My next tasks will be to fold the initialize_blocks! function into eval_grad_p! and fold that into a function, perhaps called gradient!, to modify an element of a vector passed as the first argument. (Is gradient! too general a term? Should it be lmm_gradient! or something like that?)

I am still keeping the storage used in the gradient evaluation, produced with grad_blocks, outside the LinearMixedModel struct. It can be moved there but I haven't thought through how to do that as we don't want to allocate this storage unless we are going to use it. I am thinking of an optional Boolean argument named gradient on whether to allow for the evaluation of a gradient which will then allocate the grad blocks in the LinearMixedModel struct. This argument will also affect the default choice of optimizer.

After that, a lot of testing and timings. I fear that when all is said and done gains in evaluation times will depend strongly on the type(s) and size(s) of grouping factors in the model. Perhaps we will see a gain in reliability.

palday added 3 commits December 28, 2025 22:20

write up on gradients

a2cbbae

slight optimization of gradient computation

b55d622

kb07

4aa750e

dmbates and others added 3 commits December 30, 2025 09:06

Spelling mistakes?

2765eef

Still not passing tests. In write-up made method comparisons fairer.

6440274

test fix

16cbd8e

palday and others added 11 commits December 30, 2025 13:04

methods for HessianConfig and hessian!

f184c88

format

8e8ee85

NEWS

072bd5d

oops

03009d1

docs fix: AoG update

0bfd467

Add information on gradient evaluation

463a7d6

Short-cut method of gradient evaluation

ab0a7cb

merge

269826e

Partial gradient for vector-valued r.e.'s

2e8ac37

Merge branch 'main' of github.com:JuliaStats/MixedModels.jl into db/p…

37be044

…a/gradient

Expand docs, start src/gradient.jl

cfba091

palday and others added 3 commits January 15, 2026 11:03

Merge branch 'main' of github.com:JuliaStats/MixedModels.jl into db/p…

637917a

…a/gradient

Initial, clunky version of blocked grad eval.

ad2cb99

Update document on gradient evaluation.

afc23c6

Fixed, I hope, the initialization of the gradient blocked matrix

8f3981e

Formatting changes

89ba9cd

dmbates added 4 commits February 2, 2026 17:13

Update gradient code and documents

aa54be8

Adjust tests on gradient

b05b788

Expand exploration of gradient methods

a2f9e31

Baseline code before correcting gradient! evaluation

ca8d1a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

tinkering on gradient-based optimization#871

tinkering on gradient-based optimization#871
palday wants to merge 26 commits intomainfrom
db/pa/gradient

palday commented Dec 29, 2025

Uh oh!

dmbates commented Dec 29, 2025

Uh oh!

codecov bot commented Dec 30, 2025 •

edited

Loading

Uh oh!

dmbates commented Jan 14, 2026

Uh oh!

dmbates commented Jan 14, 2026

Uh oh!

dmbates commented Jan 24, 2026

Uh oh!

dmbates commented Jan 25, 2026

Uh oh!

dmbates commented Jan 25, 2026

Uh oh!

dmbates commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

palday commented Dec 29, 2025

Uh oh!

dmbates commented Dec 29, 2025

Uh oh!

codecov bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dmbates commented Jan 14, 2026

Uh oh!

dmbates commented Jan 14, 2026

Uh oh!

dmbates commented Jan 24, 2026

Uh oh!

dmbates commented Jan 25, 2026

Uh oh!

dmbates commented Jan 25, 2026

Uh oh!

dmbates commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Dec 30, 2025 •

edited

Loading