orbital-materials
diff --git a/‎MODELS.md‎
Lines changed: 51 additions & 58 deletions b/‎MODELS.md‎
Lines changed: 51 additions & 58 deletions
diff --git a/‎README.md‎
Lines changed: 35 additions & 62 deletions b/‎README.md‎
Lines changed: 35 additions & 62 deletions
diff --git a/‎finetune.py‎
Lines changed: 3 additions & 2 deletions b/‎finetune.py‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎internal/check.py‎
Lines changed: 0 additions & 63 deletions b/‎internal/check.py‎
Lines changed: 0 additions & 63 deletions
@@ -1,59 +1,52 @@
-## Released Models
-
-
-| Model Name | Link | MD5 Hash | Matbench Discovery | D3 Corrections | Notes |
-|------------|------|----------|---------------------|-----------------|-------|
-| **orb-v1** | [link](https://orbitalmaterials-public-models.s3.us-west-1.amazonaws.com/forcefields/orbff-v1-20240827.ckpt) | 92897eda08609425ee001955c7885139 | Yes | No | Full dataset pretraining, MPTraj + Alexandria finetuning |
-| **orb-mptraj-only-v1** | [link](https://orbitalmaterials-public-models.s3.us-west-1.amazonaws.com/forcefields/orbff-mptraj-only-v1-20240827.ckpt) | ff42a2bc1e1f50b5f3ee2a20b83cf3a2 | Yes | No | MPTraj pretraining and finetuning only |
-| **orb-d3-v1** | [link](https://orbitalmaterials-public-models.s3.us-west-1.amazonaws.com/forcefields/orb-d3-v1-20240902.ckpt) | 470c7d3482ead3bc97cd4b46382d5e47 | No | Yes | Full dataset pretraining, MPTraj + Alexandria finetuning, integrated D3 corrections |
-| **orb-d3-sm-v1** | [link](https://orbitalmaterials-public-models.s3.us-west-1.amazonaws.com/forcefields/orb-d3-sm-v1-20240902.ckpt) | 64fe91603e46ad5fa695525e3f1e9397 | No | Yes | First 10 layers of a pretrained model finetuned on mptrj + alexandria with D3 corrections |
-| **orb-d3-xs-v1** | [link](https://orbitalmaterials-public-models.s3.us-west-1.amazonaws.com/forcefields/orb-d3-xs-v1-20240902.ckpt) | 79d042f9f16c4407795426a75498fbb7 | No | Yes | First 5 layers of a pretrained model finetuned on mptrj + alexandria with D3 corrections |
-
-
-
-### Matbench Discovery Results
-
-
-### orb-v1: Full dataset pretraining, MPtraj + Alexandria finetuning
-
-```
-                      orb          10k         unique
-F1              0.846577     0.988213       0.867282
-DAF             5.394101     6.389021       6.015771
-Precision       0.898971     0.976700       0.919641
-Recall          0.799953     1.000000       0.820563
-Accuracy        0.951678     0.976700       0.961608
-TPR             0.799953     1.000000       0.820563
-FPR             0.017979     1.000000       0.012939
-TNR             0.982021     0.000000       0.987061
-FNR             0.200047     0.000000       0.179437
-TP          34258.000000  9767.000000   27031.000000
-FP           3850.000000   233.000000    2362.000000
-TN         210288.000000     0.000000  180184.000000
-FN           8567.000000     0.000000    5911.000000
-MAE             0.030884     0.019012       0.030589
-RMSE            0.080986     0.064470       0.079003
-R2              0.798803     0.907903       0.815941
-```
-### orb-mptraj-only-v1: MPTraj pretraining, MPTraj finetuning
-
-```
-                     orb          10k         unique
-F1              0.752143     0.963193       0.761336
-DAF             4.267540     6.076994       4.667345
-Precision       0.711221     0.929000       0.713505
-Recall          0.798062     1.000000       0.816040
-Accuracy        0.912341     0.929000       0.921787
-TPR             0.798062     1.000000       0.816040
-FPR             0.064804     1.000000       0.059130
-TNR             0.935196     0.000000       0.940870
-FNR             0.201938     0.000000       0.183960
-TP          34177.000000  9290.000000   26882.000000
-FP          13877.000000   710.000000   10794.000000
-TN         200261.000000     0.000000  171752.000000
-FN           8648.000000     0.000000    6060.000000
-MAE             0.044745     0.040998       0.046230
-RMSE            0.093426     0.102950       0.093919
-R2              0.732243     0.780546       0.739879
-```
+## Pretrained models
 
+We provide several pretrained models that can be used to calculate energies, forces & stresses of atomic systems. All models are provided in the `orb_models.forcefield.pretrained` module.
+
+### OrbMol Models
+
+These models are a continuation of the `orb-v3` series, but are trained on the [Open Molecules 2025 (OMol25)](https://arxiv.org/pdf/2505.08762) dataset—over 100M high-accuracy DFT calculations (ωB97M-V/def2-TZVPD) on diverse molecular systems including metal complexes, biomolecules, and electrolytes.
+
+There are two options:
+* `orb-v3-conserative-omol`
+* `orb-v3-direct-omol`
+
+See below for more explanation of this naming convention. Both models have `inf` neighbors, ensuring a continuous PES.
+
+### [V3 Models](https://arxiv.org/abs/2504.06231)
+V3 models use the following naming convention: ```orb-v3-X-Y-Z``` where:
+- `X`: Model type - `direct` or `conservative`. Conservative models compute forces and stress via backpropagation, which is a physically motivated choice that appears necessary for certain types of simulation such as NVE Molecular dynamics. Conservative models are signficantly slower and use more memory than their direct counterparts.
+
+- `Y`: Maximum neighbors per atom: `20` or `inf`. A finite cutoff of `20` induces discontinuties in the PES, which can lead to significant inaccuracies for certain types of highly sensitive calculations (e.g. calculations involving Hessians). However, finite cutoffs reduce the amount of edge processing in the network, reducing latency and memory use.
+
+- `Z`: Training dataset - `omat` or `mpa`. Both of these dataset consist of small bulk crystal structures. We find that models trained on such data can generalise reasonably well to non-periodic systems (organic molecules) or partially periodic systems (slabs), but caution is advised in these scenarios.
+
+#### Features:
+- Model compilation using PyTorch 2.6.0+, enabling faster inference while maintaining support for dynamic graph sizes
+- Wider architecture (1024 vs 512) with fewer layers (5 vs 15) compared to v2, resulting in 2-3x faster performance with similar parameter count
+- Two variants available: direct models and conservative models (forces/stress computed via backpropagation)
+- Trained on the larger, more diverse OMat24 dataset
+- Improved edge embeddings using Bessel-Spherical Harmonic outer products (8 Bessel bases, Lmax=3)
+- Enhanced stability through Huber loss and a ZBL pair repulsion term added to forces
+- Models available with both unlimited neighbors and 20-neighbor maximum configurations
+- New confidence head providing intrinsic uncertainty estimates for predictions
+
+#### Advice / Caveats
+- Consider using `orb-v3-conservative-120-omat` for initial testing, specifying `precision='float32-highest'` when loading the model. This is the most computational expensive but accurate configuration. If this level of accuracy meets your needs, then other models and precisions can be investigated to improve speed and system-size scalability.
+- We do not advise using the `-mpa` models unless they are required for compatability with benchmarks (for example, Matbench Discovery). They are generally less performant.
+- Orb-v3 models are **compiled** by default and use Pytorch's dynamic batching, which means that they do not need to recompile as graph sizes change. However, the first call to the model will be slower, as the graph is compiled by torch.
+
+### [V2 Models](https://arxiv.org/abs/2410.22570)
+
+- `orb-v2` - trained on [MPTraj](https://figshare.com/articles/dataset/Materials_Project_Trjectory_MPtrj_Dataset/23713842?file=41619375) + [Alexandria](https://alexandria.icams.rub.de/).
+- `orb-mptraj-only-v2` - trained on the MPTraj dataset only to reproduce our second Matbench Discovery result. We do not recommend using this model for general use.
+- `orb-d3-v2` - trained on MPTraj + Alexandria with integrated D3 corrections. In general, we recommend using this model, particularly for systems where dispersion interactions are important. This model was trained to predict D3-corrected targets and hence is the same speed as `orb-v2`. Incorporating D3 into the model like this is substantially faster than using analytical D3 corrections.
+- `orb-d3-{sm,xs}-v2` - Smaller versions of `orb-d3-v2`. The `sm` model has 10 layers, whilst the `xs` model has 5 layers.
+
+#### Features
+- v2 models use a smoothed cosine distance cutoff for the attention mechanism, ensuring a continuous PES.
+- The force predictions now have net zero forces, meaning they are much more stable for MD simulations.
+- The models are generally more accurate (Increase in 2-3% on the matbench discovery dataset).
+
+### [V1 Models](https://arxiv.org/abs/2410.22570)
+
+Our initial release. These models were state of the art performance on the matbench discovery dataset at time of release, but have since been superceeded and removed.
@@ -27,61 +27,21 @@ Alternatively, you can use Docker to run orb-models; [see instructions below](#d
 
 ### Updates
 
-**April 2025**: We have released the [Orb-v3 set of potentials](https://arxiv.org/abs/2504.06231). These models improve substantially over Orb-v2, in particular:
+**August 2025**: Release of the OrbMol potentials (blog post forthcoming). 
 
-- Model compilation using PyTorch 2.6.0+, enabling faster inference while maintaining support for dynamic graph sizes
-- Wider architecture (1024 vs 512) with fewer layers (5 vs 15) compared to v2, resulting in 2-3x faster performance with similar parameter count
-- Two variants available: direct models and conservative models (forces/stress computed via backpropagation)
-- Trained on the larger, more diverse OMat24 dataset
-- Improved edge embeddings using Bessel-Spherical Harmonic outer products (8 Bessel bases, Lmax=3)
-- Enhanced stability through Huber loss and a ZBL pair repulsion term added to forces
-- Models available with both unlimited neighbors and 20-neighbor maximum configurations
-- New confidence head providing intrinsic uncertainty estimates for predictions
+* Trained on the [Open Molecules 2025 (OMol25)](https://arxiv.org/pdf/2505.08762) dataset—over 100M high-accuracy DFT calculations (ωB97M-V/def2-TZVPD) on diverse molecular systems including metal complexes, biomolecules, and electrolytes.
+* Architecturally similar to the highly-performant Orb-v3 models, but now explicit total charges and spins can be passed as input.  
+* To get started with these models, see: [How to specify total charge and spin for OrbMol](#how-to-specify-total-charge-and-spin-for-orbmol).
 
+**April 2025**: Release of the [Orb-v3 set of potentials](https://arxiv.org/abs/2504.06231).
 
-**Oct 2024**: We have released the [Orb-v2 set of potentials](https://arxiv.org/abs/2410.22570). These models have two major changes:
-- v2 models use a smoothed cosine distance cutoff for the attention mechanism. This is a more physically motivated cutoff that is better suited for MPNNs.
-- The force predictions now have net zero forces, meaning they are much more stable for MD simulations.
-- The models are generally more accurate (Increase in 2-3% on the matbench discovery dataset).
+**Oct 2024**: Release of the [Orb-v2 set of potentials](https://arxiv.org/abs/2410.22570). 
 
-These models are substantially better for all use cases, so we have removed the v1 models from the new orb-models package. To load the v1 models, please install the v0.3.2 version of orb-models.
+**Sept 2024**: Release of v1 models - state of the art performance on the matbench discovery dataset.
 
-**Sept 2024**: v1 models released - state of the art performance on the matbench discovery dataset.
 
-
-### Pretrained models
-
-We provide several pretrained models that can be used to calculate energies, forces & stresses of atomic systems. All models are provided in the `orb_models.forcefield.pretrained` module.
-
-#### V3 Models
-V3 models use the following naming convention:
-
-```orb-v3-X-Y-Z```
-
-where:
-- `X`: Model type (`direct` or `conservative`) - determines how forces/stress are computed
-- `Y`: Maximum neighbors per atom (`20` or `inf`)
-- `Z`: Training dataset (`omat` or `mpa`)
-
-For example, `orb-v3-conservative-inf-omat` is a model that:
-- Computes forces/stress as gradients of energy
-- Has effectively infinite neighbors (120 in practice)
-- Was trained on the OMat24 dataset
-
-
-Orb-v3 models are **compiled** by default and use Pytorch's dynamic batching, which means that they do not need to recompile as graph sizes change. However, the first call to the model will be slower, as the graph is compiled by torch.
-
-
-**We suggest using models trained on OMAT24**, as these models are more performant and the data they are trained on uses newer pseudopotentials in VASP (PBE54 vs PBE52)*. `-mpa` models should be used if compatability with benchmarks (for example, Matbench Discovery) is required.
-
-#### V2 Models
-
-- `orb-v2` - trained on [MPTraj](https://figshare.com/articles/dataset/Materials_Project_Trjectory_MPtrj_Dataset/23713842?file=41619375) + [Alexandria](https://alexandria.icams.rub.de/).
-- `orb-mptraj-only-v2` - trained on the MPTraj dataset only to reproduce our second Matbench Discovery result. We do not recommend using this model for general use.
-- `orb-d3-v2` - trained on MPTraj + Alexandria with integrated D3 corrections. In general, we recommend using this model, particularly for systems where dispersion interactions are important. This model was trained to predict D3-corrected targets and hence is the same speed as `orb-v2`. Incorporating D3 into the model like this is substantially faster than using analytical D3 corrections.
-- `orb-d3-{sm,xs}-v2` - Smaller versions of `orb-d3-v2`. The `sm` model has 10 layers, whilst the `xs` model has 5 layers.
-
-For more information on the models, please see the [MODELS.md](MODELS.md) file.
+### Available models
+See [MODELS.md](MODELS.md) for a full list of available models along with guidance.
 
 
 ### Usage
@@ -104,11 +64,9 @@ orbff = pretrained.orb_v3_conservative_inf_omat(
 )
 atoms = bulk('Cu', 'fcc', a=3.58, cubic=True)
 graph = atomic_system.ase_atoms_to_atom_graphs(atoms, orbff.system_config, device=device)
-atoms = bulk('Cu', 'fcc', a=3.58, cubic=True)
-graph = atomic_system.ase_atoms_to_atom_graphs(atoms, orbff.system_config, device=device)
 
-# Optionally, batch graphs for faster inference
-# graph = batch_graphs([graph, graph, ...])
+# If you have several graphs, batch them like so:
+# graph = batch_graphs([graph1, graph2, ...])
 
 result = orbff.predict(graph, split=False)
 
@@ -160,6 +118,28 @@ print("Optimized Energy:", atoms.get_potential_energy())
 
 Or you can use it to run MD simulations. The script, an example input xyz file and a Colab notebook demonstration are available in the [examples directory.](./examples) This should work with any input, simply modify the input_file and cell_size parameters. We recommend using constant volume simulations.
 
+#### How to specify total charge and spin for OrbMol
+
+The OrbMol models *require* total charge and spin to be specified. This can be done by setting them in `atoms.info` dictionary.
+
+```python
+import ase
+from ase.build import molecule
+from orb_models.forcefield import atomic_system, pretrained
+from orb_models.forcefield.base import batch_graphs
+
+device = "cpu"  # or device="cuda"
+orbff = pretrained.orb_v3_conservative_omol(
+  device=device,
+  precision="float32-high",   # or "float32-highest" / "float64
+)
+atoms = molecule("C6H6")
+atoms.info["charge"] = 1.0  # total charge
+atoms.info["spin"] = 0.0  # total spin
+graph = atomic_system.ase_atoms_to_atom_graphs(atoms, orbff.system_config, device=device)
+
+result = orbff.predict(graph, split=False)
+```
 
 #### Confidence head (Orb-v3 Models Only)
 
@@ -216,16 +196,9 @@ The dataset should be an [ASE sqlite database](https://wiki.fysik.dtu.dk/ase/ase
 ```python
 python finetune.py --dataset=<dataset_name> --data_path=<your_data_path> --base_model=<base_model>
 ```
-Where base_model is one of:
-- "orb_v3_conservative_inf_omat"
-- "orb_v3_conservative_20_omat"
-- "orb_v3_direct_inf_omat"
-- "orb_v3_direct_20_omat"
-- "orb_v2"
-
-After the model is finetuned, checkpoints will, by default, be saved to the ckpts folder in the directory you ran the finetuning script from.
+Where base_model is an element of `orb_models.forcefield.pretrained.ORB_PRETRAINED_MODELS.keys()`.
 
-You can use the new model and load the checkpoint by:
+After the model is finetuned, checkpoints will, by default, be saved to the ckpts folder in the directory you ran the finetuning script from. You can use the new model and load the checkpoint by:
 ```python
 from orb_models.forcefield import pretrained
 
 
@@ -259,12 +259,13 @@ def run(args):
         wandb.define_metric("step")
         wandb.define_metric("finetune_step/*", step_metric="step")
 
+    graph_targets = ["energy", "stress"] if model.has_stress else ["energy"]
     loader_args = dict(
         dataset_name=args.dataset,
         dataset_path=args.data_path,
         num_workers=args.num_workers,
         batch_size=args.batch_size,
-        target_config={"graph": ["energy", "stress"], "node": ["forces"]},
+        target_config={"graph": graph_targets, "node": ["forces"]},
     )
     train_loader = build_train_loader(
         **loader_args,
@@ -379,7 +380,7 @@ def main():
     )
     parser.add_argument(
         "--lr",
-        default=3e-04,
+        default=3e-4,
         type=float,
         help="Learning rate. 3e-4 is purely a sensible default; you may want to tune this for your problem.",
     )