Update CI: Test min + LTS and test macOS with native binaries#1196
Update CI: Test min + LTS and test macOS with native binaries#1196pkofod merged 3 commits intoJuliaNLSolvers:masterfrom
Conversation
|
the tests that fail are quite sensitive / unstable |
|
The failing macOS test uses the native aarch64 binaries in this PR but currently on master runs with Rosetta emulation. |
|
I can make a commit to adjust the tests accordingly |
.github/workflows/CI.yml
Outdated
| - "min" | ||
| - "lts" | ||
| - "1" | ||
| - "pre" |
There was a problem hiding this comment.
It's a bit questionable whether to always run tests on "pre" - when there's no new prerelease available (e.g. for 1.13), then tests will be identical to "1".
That would be great, I'm not familiar with that part of Optim yet. |
Benchmark Results (Julia vlts)Time benchmarks
Memory benchmarks
A plot of the benchmark results has been uploaded as an artifact at . |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #1196 +/- ##
==========================================
+ Coverage 87.59% 87.80% +0.20%
==========================================
Files 45 45
Lines 3515 3517 +2
==========================================
+ Hits 3079 3088 +9
+ Misses 436 429 -7 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
@pkofod You could release the bug fix in JuliaNLSolvers/NLSolversBase.jl#163 (I bumped the version in that PR and the changes up until that commit should be non-breaking) to see if it fixes the test failures here. |
the constraints bug? Those two failures are not using constraints though |
|
Ah, OK, I just saw that test failures were related to IPNewton. |
the failing tests here were NGMRES (in my experience quite unstable) and now it's momentum (also quite unstable). I spent some time checking the implementation and I think it is just unstable, maybe especially with a line search. It typically is used for large systems and "machine learning" so often you just have a small step size / "learning rate" or even some kind of decay structure that forces the procedure to settle down. |
If the target is a breaking release anyway, we could force users to choose a line search here. |
|
given the length of the skip list it may be time to find a better way to test this method :) |
No description provided.