Refactor SCF quantites and convergence by kubanmar · Pull Request #260 · FAIRmat-NFDI/nomad-simulations

kubanmar · 2025-09-04T15:45:46Z

In this PR I want to refactor how to deal with not converged or otherwise not finished calculations.

The first step is to refactor and unify how we represent the convergence of the simulation workflow. This PR introduces a new subsection WorkflowConvergenceTarget to all cases where a workflow converges, e.g., SCF, or geometry optimization. It is intended to replace the convergence parameters in GeometryOptimizationModel as well as the convergence settings in SCFOutputs and SelfConsistency.

SCFOutputs is very minimal now and I don't know what it is used for. The information about the SCF should be included in the workflow section. Should SCFOutputs be a reference then, to not duplicate information? Is SCFOutputs used anywhere?

This is a draft PR, I left the failing tests in there on purpose to keep track of the functionality that I broke by removing SelfConsistency.

Note: This PR breaks the abinit parser, because it tries to populate workflow.geometry_optimization.GeometryOptimizationModel.convergence_tolerance_energy_difference, which moved to WorkflowConvergenceTarget

Summary of changes:

add more quantities to Program
add WorkflowConvergenceTarget
add WorkflowConvergenceTarget to SimulationWorkflowModel
remove SelfConsistency
remove self_consistency_ref from PhysicalProperty
remove convergence from SCFOutputs
remove convergence from GeometryOptimizationModel

JFRudzinski

This looks good to me.

TODOs (ideally in the next 2 weeks, i.e., by 27.10):

Add SCF as a workflow class, like GeomOpt
Determine a standard for populating SCF and SCF+GeomOpt, ideally using a representative parser
Align with @ndaelman in the context of #191 and decide what indicators will be present in data.outputs
Consider backcompatibility / integration procedure in the parsers

coveralls · 2025-10-17T13:42:01Z

Pull Request Test Coverage Report for Build 18594497000

Details

21 of 21 (100.0%) changed or added relevant lines in 6 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage decreased (-0.01%) to 90.259%

Totals
Change from base Build 18342061192:	-0.01%
Covered Lines:	5958
Relevant Lines:	6601

💛 - Coveralls

kubanmar · 2025-11-14T16:46:58Z

The current status of the implementation is:

the schema implements convergence targets for geometry optimization and single point workflows (individually)
implemented in exciting parser (https://github.com/FAIRmat-NFDI/nomad-parser-plugins-simulation/tree/schema_update_convergence_targets)
TODO see where SCF iterations data goes
TODO resolve conflicts

JFRudzinski · 2025-11-15T06:56:31Z

That's great @kubanmar , thanks for pushing it forward 🙌 let's see if there is anytime next week, but if not it can be wrapped up the following week

ndaelman-hu · 2025-11-25T09:54:51Z

src/nomad_simulations/schema_packages/workflow/general.py

+    convergence_threshold_unit = Quantity(
+        type=str,
+        description="""
+        Unit using the pint UnitRegistry() notation for the `convergence_threshold`.
+        """
+    )


Note that this won't be compatible with the NOMAD framework for handling units. It won't convert when the user asks so. Moreover, to fit your description, you should alter the type. This still won't power the unit conversion on request.

The modular handling of units is very hard. Many ppl have given it a shot. The closest we have right now are these dataframes.

kubanmar · 2025-12-12T17:11:47Z

Newest updates:

correct behavior for SinglePoint workflows
new challenge when populating the workflow section of the archive directly: mapping of tasks becomes nested

TODO:

find a way to pass information of the convergence of the single points in a geometry optimization to the top level workflow

I have left some comments in the code where I could need some help.

JFRudzinski · 2025-12-16T13:59:40Z

@kubanmar can you rename this PR so the scope is more clear, include SCF

kubanmar · 2025-12-19T16:18:59Z

The latest commit provides a working example:

convergence targets are correctly recognized and during normalization the corresponding results are created
correct assignment of single point vs. geometry optimization convergence targets
independently reporting of geometry optimization and SCF convergence

TODO:

resolve conflicts
tests

@EBB2675 if you want to review/contribute I would be happy, otherwise I will continue working on it after the break

coveralls · 2026-01-28T16:16:56Z

Pull Request Test Coverage Report for Build 22540757767

Details

487 of 547 (89.03%) changed or added relevant lines in 12 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.2%) to 83.383%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
tests/conftest.py	5	9	55.56%
src/nomad_simulations/schema_packages/workflow/geometry_optimization.py	43	62	69.35%
src/nomad_simulations/schema_packages/workflow/general.py	135	172	78.49%

Totals
Change from base Build 22018524490:	0.2%
Covered Lines:	7843
Relevant Lines:	9406

💛 - Coveralls

kubanmar · 2026-01-28T16:37:30Z

I have now merged develop and fixed formatting and type checks. There are still plenty of TODOs for which I would appreciate help.

Most severe is that this is only compatible with NOMAD v1.3, as the newer version generates too many nested subsections in for the workflow tasks. If I understood @EBB2675 correctly this is not a schema/parser level issue, but can only be addressed on the mapping parser level.

Using https://github.com/FAIRmat-NFDI/nomad-parser-plugins-simulation/tree/schema_update_convergence_targets for the parsers, the code now generates the intended data for: exciting single point calculations and geometry optimizations.

From my side, this is therefore ready for review.

kubanmar · 2026-01-30T13:34:40Z

TODOs:

should one replace the repeating subsection of ConvergenceTargets with a section definition that explicitly defines all targets as quantities (see top of workflow/general) - it would be nice if everyone could comment on this
there are not tests, at least for the normalization they would be useful
TODOs in the code

ndaelman-hu · 2026-02-26T10:31:27Z

residuum is semantically inconsistent with other threshold_type values. The other four threshold_type values (absolute, relative, maximum, rms) are mathematical comparison methods that describe how to perform the convergence check - pure mathematical operations independent of the physical quantity being checked.

residuum is different: it specifies what to compare - the difference between the current value and the value estimated from the wavefunction, rather than the difference between successive iterations (general.py:54). This makes residuum quantity-specific (only applicable to density/wavefunction), while the other four are quantity-agnostic.

Additionally, residuum currently has no supporting implementation logic.

I recommend removing residuum from threshold_type. When codes report wavefunction/density residuals, create specialized convergence target classes:

class WavefunctionResidualConvergenceTarget(WorkflowConvergenceTarget):
    threshold = Quantity(type=np.float64, unit='coulomb')
    threshold_type = Quantity(...)  # Can still use absolute/rms/etc for the residual
    _convergence_property_path = 'scf_steps.wavefunction_residuals'

This maintains the clean semantic separation: class = physical quantity, threshold_type = mathematical comparison method.

ndaelman-hu · 2026-02-26T10:45:36Z

src/nomad_simulations/schema_packages/workflow/general.py

+            if isinstance(value, int | float | np.floating):
+                # Scalar value - use absolute or relative
+                if conv_type == 'absolute':
+                    return self._check_absolute(value)
+                elif conv_type == 'relative':
+                    # For relative, child class should provide reference
+                    logger.warning(
+                        f'Relative convergence requires reference value in '
+                        f'{self.__class__.__name__}'
+                    )
+                    return None
+                else:
+                    return self._check_absolute(value)
+
+            elif isinstance(value, np.ndarray):
+                # Array value - can use maximum or rms
+                if conv_type == 'maximum':
+                    return self._check_maximum(value)
+                elif conv_type == 'rms':
+                    return self._check_rms(value)
+                elif conv_type == 'absolute':
+                    # For array, treat as maximum
+                    return self._check_maximum(value)
+                else:
+                    return self._check_maximum(value)


Issue: The `threshold_type` dispatch is coupled to data shape, creating artificial constraints where data shape dictates which comparison methods are valid rather than user intent or physical meaning. Why can't you check `absolute` on each array element, or `maximum` of a scalar?

Additionally, there's silent fallback behavior with no warnings when `threshold_type` doesn't match data shape:

Parser sets `threshold_type='maximum'` but data is scalar → silently uses `absolute` instead (line 219)

Parser sets `threshold_type='absolute'` but data is array → silently uses `maximum` instead (line 229)

The `threshold_type` should describe the mathematical operation independent of data shape.

After reflecting more on it: The threshold_type enum conflates three orthogonal concerns:

Data shape: scalar vs array

Aggregation method: How to reduce vectors to comparable scalars (maximum, rms)

Comparison type: How to compare values (absolute, relative)

These should be independent. For example:

You might want maximum aggregation of force components with relative convergence (current max force compared to previous max force)

Or rms aggregation of energies across a trajectory with absolute convergence

The current design forces specific pairings that may not match the physical quantity or convergence criteria desired.

A cleaner design might separate:

aggregation_method = MEnum('scalar', 'maximum', 'rms') # How to reduce arrays comparison_type = MEnum('absolute', 'relative') # How to compare

ndaelman-hu · 2026-02-26T13:31:40Z

Question about workflow2.results structure

I noticed that archive.workflow2.results follows a nested subsection-with-quantities structure rather than the repeating section pattern used in archive.data.

Section-based structure (archive.data):

archive.data
  ├── model_system[] (repeating sections)
  ├── model_method[] (repeating sections)
  └── outputs[] (repeating sections)

Workflow structure (archive.workflow2):

archive.workflow2
  ├── method (subsection)
  ├── results (subsection with quantities)
  │     ├── is_converged (quantity)
  │     ├── final_energy_difference (quantity)
  │     ├── final_force_maximum (quantity)
  │     └── convergence[] (repeating subsection)
  └── tasks[] (repeating sections)

Question: Is the quantity structure `workflow2.results` meant to keep shadowing `archive.results`?

Understanding this would help clarify the design intent and whether there are legacy patterns that should be considered for future schema evolution.

@JFRudzinski

Change threshold type in all convergence target classes from `np.float64` to `positive_float()` to enforce schema-level validation. This ensures thresholds are non-negative (x ≥ 0), allowing zero thresholds but rejecting semantically invalid negative values. Updated classes: - `EnergyConvergenceTarget` - `ForceConvergenceTarget` - `PotentialConvergenceTarget` - `ChargeConvergenceTarget` Add test to verify zero thresholds are accepted. Remove test for negative thresholds as these are now prevented at the schema level.

ladinesa · 2026-02-26T14:13:32Z

Question about workflow2.results structure

I noticed that archive.workflow2.results follows a nested subsection-with-quantities structure rather than the repeating section pattern used in archive.data.

Section-based structure (archive.data):
archive.data
  ├── model_system[] (repeating sections)
  ├── model_method[] (repeating sections)
  └── outputs[] (repeating sections)
Workflow structure (archive.workflow2):
archive.workflow2
  ├── method (subsection)
  ├── results (subsection with quantities)
  │     ├── is_converged (quantity)
  │     ├── final_energy_difference (quantity)
  │     ├── final_force_maximum (quantity)
  │     └── convergence[] (repeating subsection)
  └── tasks[] (repeating sections)
Question: Is the quantity structure workflow2.results meant to keep shadowing archive.results?

Understanding this would help clarify the design intent and whether there are legacy patterns that should be considered for future schema evolution.

@JFRudzinski

It do not mean it to shadow archive.results. In the same spirit that archive.results contain a 'summary' of data, workflow.results is a summary of the workflow. I would like to think that there are quantities that are specific to workflow.results. .I concede that there are quantities are seemed to be duplicated e.g. in for geometry_optimization workflow, tolerances appear in both. But my opinion is this should not be the case. For me this is limited by the requirement that search indices and the gui are built from archive.results. This already came up from the new results normalizer discussion if we indeed rely on archive.results for everything or we can directly work with the data/ workfow sections.

ndaelman-hu · 2026-02-26T14:22:12Z

Question about workflow2.results structure
I noticed that archive.workflow2.results follows a nested subsection-with-quantities structure rather than the repeating section pattern used in archive.data.
Section-based structure (archive.data):
archive.data
  ├── model_system[] (repeating sections)
  ├── model_method[] (repeating sections)
  └── outputs[] (repeating sections)
Workflow structure (archive.workflow2):
archive.workflow2
  ├── method (subsection)
  ├── results (subsection with quantities)
  │     ├── is_converged (quantity)
  │     ├── final_energy_difference (quantity)
  │     ├── final_force_maximum (quantity)
  │     └── convergence[] (repeating subsection)
  └── tasks[] (repeating sections)
Question: Is the quantity structure workflow2.results meant to keep shadowing archive.results?
Understanding this would help clarify the design intent and whether there are legacy patterns that should be considered for future schema evolution.
@JFRudzinski
It do not mean it to shadow archive.results. In the same spirit that archive.results contain a 'summary' of data, workflow.results is a summary of the workflow. I would like to think that there are quantities that are specific to workflow.results.

I concede that there are quantities are seemed to be duplicated e.g. in for geometry_optimization workflow, tolerances appear in both. But my opinion is this should not be the case.

Thx for clarifying! These overlapping quantities is why I asked about the shadowing in the first place.

For me this is limited by the requirement that search indices and the gui are built from archive.results. This already came up from the new results normalizer discussion if we indeed rely on archive.results for everything or we can directly work with the data/ workfow sections.

I see. I raised this question since all other SCF data is now handled by sections. This is the only exception, and it is relevant to plotting. I was thinking off turning it also into sections for consistency and polymorphism. I had not yet considered how it would impact query performance (or possibility).

JFRudzinski · 2026-02-26T15:19:09Z

yes, 100% agree with @ladinesa ... there are some other developments of the results section and normalization that was met at the end of the day yesterday, but I will report at our next meeting

- Add `WavefunctionConvergenceTarget` class to `general.py` for tracking wavefunction coefficient convergence in SCF workflows - Add `delta_wavefunction_rms` quantity to `SCFSteps` schema in `outputs.py` to store RMS changes of wavefunction coefficients - Add comprehensive test suite covering edge cases: zero convergence, missing data, single iteration, NaN/Inf handling, negative values, boundary conditions, and array vs scalar data - Use `positive_float()` for threshold validation (x ≥ 0) - Set `_convergence_property_path` to `scf_steps.delta_wavefunction_rms` for automatic resolution All 51 convergence target tests pass.

- Remove `residuum` from `threshold_type` enum (semantically inconsistent - describes WHAT not HOW) - Remove unused imports (`Iterable`, `jmespath`, `SimulationTime`, `Outputs`) - Simplify `WorkflowConvergenceTarget` docstring

ndaelman-hu · 2026-03-01T18:57:46Z

Resolved in commit 705b7d9.

Removed residuum from the threshold_type enum and created WavefunctionConvergenceTarget class for wavefunction convergence tracking (when parsers extract this data).

ndaelman-hu · 2026-03-01T20:50:12Z

Created follow-up issues from review discussion:

Separate aggregation and comparison methods in threshold_type #340: Separate aggregation and comparison methods in threshold_type
Design: workflow2.results structure and relationship to archive.results #341: Design: workflow2.results structure and relationship to archive.results

ndaelman-hu · 2026-03-03T09:48:04Z

Relative Convergence and Dimensionless Thresholds

@JFRudzinski - I've been reviewing the property-specific polymorphic approach you implemented for convergence targets and noticed a semantic mismatch with relative convergence.

The relative convergence formula |value| / |reference| < threshold produces a dimensionless ratio. However, the current schema enforces dimensional thresholds on property-specific classes. For example, EnergyConvergenceTarget has threshold with unit='joule', but for relative convergence the threshold should be dimensionless (e.g., 0.001 for 0.1% convergence).

Current State:

The implementation is incomplete and misleading:

The _check_relative() helper method incorrectly works around the unit mismatch by extracting magnitude values to compare the dimensionless ratio with the dimensional threshold.
However, normalize() (lines 231-237) blocks relative convergence entirely - it logs a warning and returns None without performing any check.
Tests pass because they call _check_relative() directly, bypassing normalize(). The test annotation at line 517 states "threshold is dimensionless but stored in base unit", documenting the workaround as if it were intentional design.
The abinit parser sets threshold_type='relative' but doesn't actually receive convergence checking - it silently gets None with a warning in the logs.

I see three options to resolve this:

Option A: Split out relative convergence as a separate class with RelativeConvergenceTarget(property_name, threshold: dimensionless). This enforces correctness at the schema level but creates semantic distinction at both the class and quantity level.

Option B: Disable unit checking on threshold and validate units at runtime in normalize(). This keeps the current class structure but moves schema constraints into imperative code.

Option C: Remove relative convergence support entirely. While this is simplest in the short term and avoids polymorphism explosion, it becomes problematic if we need relative convergence in the future (as the abinit parser already does). The current system provides no clear path forward for adding this functionality back without encountering the same unit mismatch issue.

Which approach aligns better with the overall schema design philosophy?

ndaelman-hu · 2026-03-03T09:50:06Z

RMS Semantics Disambiguation

@mkuban - I wanted to double-check the intended semantics of RMS convergence.

The threshold_type description (line 55) states that RMS "provides a statistical measure of overall convergence for multi-component properties" and operates "across all components". The phrasing suggests a statistical interpretation, but "components" could refer to vector components (Fx, Fy, Fz) or to a collection of entities.

The current implementation for forces first computes vector norms per atom, then applies RMS to that list of atomic force norms. This is consistent with a statistical aggregation interpretation (L2 norm over the collection, similar to how maximum uses L∞ norm).

Could you confirm that this two-level approach is the intended semantics for RMS?

kubanmar self-assigned this Sep 4, 2025

JFRudzinski reviewed Oct 10, 2025

View reviewed changes

JFRudzinski mentioned this pull request Nov 5, 2025

Where to put (SCF) Convergence #207

Open

ndaelman-hu reviewed Nov 25, 2025

View reviewed changes

kubanmar requested review from Bernadette-Mohr, EBB2675 and ladinesa December 12, 2025 17:12

JFRudzinski mentioned this pull request Dec 16, 2025

Restructure SCF #191

Closed

kubanmar changed the title ~~Warnings and errors~~ Refactor SCF quantites and convergence Dec 16, 2025

kubanmar marked this pull request as ready for review January 28, 2026 16:39

kubanmar requested review from JFRudzinski and ndaelman-hu January 28, 2026 16:39

kubanmar added 9 commits February 17, 2026 13:47

draft workflow convergence subsection

e99f712

add details for simulation program

bab6807

refactor representation of convergence

caca669

remove SCFOutputs

f02a9a2

add scf loop quantities to output

028e5bf

WIP workflow convergence target

50ee380

convergence for geo opt and single point

0af5f3c

WIP implementation of convergence normalization

abe310e

bugfixes

0b5342c

JFRudzinski force-pushed the warnings_and_errors branch from 9aadb92 to 88a7159 Compare February 17, 2026 12:52

JFRudzinski added 3 commits February 17, 2026 14:00

rebase fixes

d2eaf24

update Method usage

ce07b0c

update MD

c20742d

This was referenced Feb 18, 2026

Schema update convergence targets FAIRmat-NFDI/nomad-parser-plugins-simulation#150

Draft

Scf convergence migration FAIRmat-NFDI/nomad-parser-plugins-simulation#151

Draft

JFRudzinski requested review from ndaelman-hu and removed request for Bernadette-Mohr, EBB2675, JFRudzinski, ladinesa and ndaelman-hu February 22, 2026 20:54

ndaelman-hu reviewed Feb 26, 2026

View reviewed changes

JFRudzinski approved these changes Feb 27, 2026

View reviewed changes

ndaelman-hu added 2 commits February 27, 2026 23:28

Clean up WorkflowConvergenceTarget enum and imports

705b7d9

- Remove `residuum` from `threshold_type` enum (semantically inconsistent - describes WHAT not HOW) - Remove unused imports (`Iterable`, `jmespath`, `SimulationTime`, `Outputs`) - Simplify `WorkflowConvergenceTarget` docstring

This was referenced Mar 1, 2026

Separate aggregation and comparison methods in threshold_type #340

Open

Design: workflow2.results structure and relationship to archive.results #341

Open

Conversation

kubanmar commented Sep 4, 2025

Uh oh!

JFRudzinski left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 18594497000

Details

💛 - Coveralls

Uh oh!

kubanmar commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JFRudzinski commented Nov 15, 2025

Uh oh!

ndaelman-hu Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

kubanmar commented Dec 12, 2025

Uh oh!

JFRudzinski commented Dec 16, 2025

Uh oh!

kubanmar commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 22540757767

Details

💛 - Coveralls

Uh oh!

kubanmar commented Jan 28, 2026

Uh oh!

kubanmar commented Jan 30, 2026

Uh oh!

ndaelman-hu commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ndaelman-hu Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

ndaelman-hu Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

ndaelman-hu commented Feb 26, 2026

Uh oh!

ladinesa commented Feb 26, 2026

Uh oh!

ndaelman-hu commented Feb 26, 2026

Uh oh!

JFRudzinski commented Feb 26, 2026

Uh oh!

ndaelman-hu commented Mar 1, 2026

Uh oh!

ndaelman-hu commented Mar 1, 2026

Uh oh!

ndaelman-hu commented Mar 3, 2026

Relative Convergence and Dimensionless Thresholds

Uh oh!

ndaelman-hu commented Mar 3, 2026

RMS Semantics Disambiguation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

JFRudzinski left a comment •

edited

Loading

coveralls commented Oct 17, 2025 •

edited

Loading

kubanmar commented Nov 14, 2025 •

edited

Loading

kubanmar commented Dec 19, 2025 •

edited

Loading

coveralls commented Jan 28, 2026 •

edited

Loading

ndaelman-hu commented Feb 26, 2026 •

edited

Loading