Transolver volume #1242

coreyjadams · 2025-11-20T22:27:47Z

PhysicsNeMo Pull Request

This PR brings several updates:

The Transolver datapipe has been simplified and largely merged with DoMINO's. This is a great performance boost.
The Transolver example supports volumetric training.
The Transolver++ model is, optionally, supported.

Still minor details to wrap up:

Add plots and training results to the README for transolver.
Update the changelog.
Ensure the inference_on_zarr script works on volumetric data
Update or deprecate the inference on stl script, since we can evaluate R2, etc., right from zarr.

Description

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
[] The CHANGELOG.md is up to date with these changes.
An issue is linked to this pull request.

Dependencies

Review Process

All PRs are reviewed by the PhysicsNeMo team before merging.

Depending on which files are changed, GitHub may automatically assign a maintainer for review.

We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.

AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.

…taset. Surface dataloading is prototyped, not finished yet.

… data. Applied some cleanup to make the datapipe similar to domino, which is a step towards unification.

…to a flat LR.

greptile-apps · 2025-11-20T22:30:21Z

Greptile Overview

Greptile Summary

Overview

This PR introduces significant enhancements to the PhysicsNeMo transolver capabilities:

New Typhon Model: Geometry-Aware Physics Attention transformer extending Transolver with cross-attention to geometry and global context
Volumetric Training Support: Transolver now supports both surface and volumetric training modes
Unified Datapipe: Merged Transolver and DoMINO datapipes for improved performance and consistency
Transolver++ Features: Optional support for advanced features including gumbel softmax and learnable temperature

Critical Issue

Typhon Model Bug (physicsnemo/experimental/models/typhon/typhon.py:809-816): The embedding_states variable is referenced without being assigned when neither geometry nor global embeddings are provided. This will cause a runtime UnboundLocalError when the model is instantiated without optional context inputs.

Architecture Changes

The datapipe refactoring consolidates preprocessing logic and improves flexibility with configurable symmetries (translational invariance, scale invariance). The new CAEDataset enhancements support reading zarr attributes and more flexible volume key detection.

Checklist Status

Per the PR description, the following items remain incomplete:

CHANGELOG.md update pending
Training results/plots for README pending
Basic unit tests for Typhon pending (acknowledged as acceptable for experimental)

Important Files Changed

File Analysis

Filename	Score	Overview
physicsnemo/experimental/models/typhon/typhon.py	2/5	New Typhon model with critical bug: `embedding_states` undefined when no geometry/global embeddings provided
physicsnemo/datapipes/cae/transolver_datapipe.py	5/5	New unified datapipe merging Transolver and DoMINO pipelines for surface/volume training
physicsnemo/models/transolver/Physics_Attention.py	5/5	Enhanced with Transolver++ support (gumbel_softmax) and optional transformer-engine imports
examples/cfd/external_aerodynamics/transolver/src/train.py	5/5	Unified training script supporting both Transolver and Typhon with FP8/mixed precision
physicsnemo/datapipes/cae/cae_dataset.py	5/5	Enhanced to support zarr attributes and more flexible volume key detection

greptile-apps

_{22 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-11-20T22:30:20Z

physicsnemo/experimental/models/typhon/typhon.py

+        if len(global_context_input) > 0:
+            embedding_states = torch.cat(global_context_input, dim=-1)
+
+        # Project the inputs to the hidden dimension:
+        x = self.preprocess(local_embedding)
+
+        for block in self.blocks:
+            x = block(x, embedding_states)


logic: embedding_states referenced before assignment when no geometry or global embeddings provided

Suggested change

if len(global_context_input) > 0:

embedding_states = torch.cat(global_context_input, dim=-1)

# Project the inputs to the hidden dimension:

x = self.preprocess(local_embedding)

for block in self.blocks:

x = block(x, embedding_states)

# Construct the embedding states:

if len(global_context_input) > 0:

embedding_states = torch.cat(global_context_input, dim=-1)

else:

embedding_states = None

# Project the inputs to the hidden dimension:

x = self.preprocess(local_embedding)

for block in self.blocks:

x = block(x, embedding_states)

RishikeshRanade

LGTM

RishikeshRanade · 2025-11-21T17:49:04Z

examples/cfd/external_aerodynamics/transolver/README.md

+inference workloads are different, so these aim to cover common scenarios as examples. -->
+
+The validation dataset in Zarr format can be loaded, processed, and the L2
+metrics summarized in `inference_on_zarr.py`.  For surface data, this script will also


Can you add that return_mesh_neighbors should be set to true to run this?

RishikeshRanade · 2025-11-21T17:52:56Z

examples/cfd/external_aerodynamics/transolver/src/metrics.py


    metrics = {
-        "l2_pressure": torch.mean(l2[:, 0]),
+        "l2_pressure_surf": torch.mean(l2[:, 0]),


In the readme we need to mention that this part needs to be changed when extending to a different use case. The other way is to describe variables in config.yaml like domino and read them from there.

RishikeshRanade · 2025-11-21T17:56:29Z

examples/cfd/external_aerodynamics/transolver/src/train.py



-def pad_input_for_fp8(features: torch.Tensor, embeddings: torch.Tensor) -> torch.Tensor:
+def pad_input_for_fp8(


Do these need to be part of train.py? Can these functions be moved to utils or part of datapipe?

RishikeshRanade · 2025-11-21T17:57:06Z

examples/cfd/external_aerodynamics/transolver/src/train.py

        dataloader: Training data loader
-        sampler (torch.utils.data.Sampler): Sampler for distributed or sequential sampling.
        model (torch.nn.Module): The neural network model to train.
+        epoch_len (int): Length of the epoch.


Is this number of epochs?

RishikeshRanade · 2025-11-21T18:11:41Z

examples/cfd/external_aerodynamics/transolver/README.md

+[2025-11-19 07:02:38,387][training][INFO] - Summary:
+| Batch   |   Loss |   L2 Pressure |   L2 Shear X |   L2 Shear Y |   L2 Shear Z |   Predicted Drag Coefficient |   Pred Lift Coefficient |   True Drag Coefficient |   True Lift Coefficient |   Elapsed (s) |
+|---------|--------|---------------|--------------|--------------|--------------|------------------------------|-------------------------|-------------------------|-------------------------|---------------|
+| Mean    | 0.0311 |        0.0614 |       0.0921 |        0.108 |       0.1214 |                       5.2949 |                  1.9137 |                  5.2962 |                  1.9329 |       11.4647 |


Is this updated? These numbers look higher than what we discussed, right?

coreyjadams added 30 commits October 1, 2025 19:20

Implement transolver ++ physics attention

6f5b416

Enable ++ in Transolver.

dcd0841

Merge branch 'NVIDIA:main' into transolver_plus

f8d7736

Merge branch 'NVIDIA:main' into transolver_plus

58d1325

Fix temperature correction terms.

3f9d618

Merge branch 'NVIDIA:main' into transolver_plus

9d76330

Merge branch 'NVIDIA:main' into transolver_plus

bf82635

Merge branch 'NVIDIA:main' into transolver_plus

c4ed105

Starting work adapting the domino datapipe techniques to transolver.

e97c6d2

Working towards transolver volume training by mergeing with domino da…

987e502

…taset. Surface dataloading is prototyped, not finished yet.

Merge branch 'NVIDIA:main' into transolver-volume

98d0f76

Updating

d734cd7

Remove printout

606d1ef

Enable transolver for volumetric data

53c72c1

Update transolver training script to support either surface or volume…

9f9b256

… data. Applied some cleanup to make the datapipe similar to domino, which is a step towards unification.

Updating datapipe

713f2a9

Tweak transolver volume configs

908fb52

Merge branch 'main' into transolver-volume

9bd97fa

Merge branch 'main' into transolver_plus

20e00bd

Merge branch 'NVIDIA:main' into transolver_plus

9780219

Merge branch 'NVIDIA:main' into transolver-volume

27c2cec

Add transolverX model

765dd9e

Merge branch 'NVIDIA:main' into transolver-volume

31df3bd

Merge branch 'NVIDIA:main' into transolver-volume

38a500c

Merge branch 'NVIDIA:main' into transolver_plus

22af035

Enable nearly-uniform sampling of very very large arrays

155568e

limit benchmarking to train epoch, enable profiler in config

af37fd8

Update volume config slightly

abc86a7

Merge branch 'transolver_plus' into transolver-volume

6860bea

Update training scripts to properly enable data preloading

926587f

coreyjadams added 13 commits November 5, 2025 09:37

Merge branch 'NVIDIA:main' into transolver-volume

6664323

Working towards adding a muon optimzier in transolver

1bc4af6

Add peter's implementation of muon with a combined optimizer. switch …

f7d1739

…to a flat LR.

Merge branch 'NVIDIA:main' into transolver-volume

56af9dd

Merge branch 'NVIDIA:main' into transolver-volume

4c1ebea

Merge branch 'NVIDIA:main' into transolver-volume

dc386ef

Add updated inference script that can also calculate drag and lift

c72a6b3

Add better docstrings for typhon

a795048

Move typhon to experimental

83ed709

Move forwards docstring

d4f2436

Adding typhon model and configs.

524748e

Update readme.

751ff68

Merge branch 'main' into transolver-volume

dedd507

greptile-apps bot reviewed Nov 20, 2025

View reviewed changes

RishikeshRanade approved these changes Nov 21, 2025

View reviewed changes

RishikeshRanade reviewed Nov 21, 2025

View reviewed changes

coreyjadams changed the title ~~Transolver volume + Typhon Model~~ Transolver volume Nov 21, 2025

Merge branch 'main' into transolver-volume

88dd3be

bstaber mentioned this pull request Nov 25, 2025

🐛[BUG]: Can't use Transolver model without transformer_engine #1248

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transolver volume #1242

Transolver volume #1242

Uh oh!

coreyjadams commented Nov 20, 2025 •

edited

Loading

Uh oh!

greptile-apps bot commented Nov 20, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Nov 20, 2025

Uh oh!

RishikeshRanade left a comment

Uh oh!

RishikeshRanade Nov 21, 2025

Uh oh!

RishikeshRanade Nov 21, 2025

Uh oh!

RishikeshRanade Nov 21, 2025

Uh oh!

RishikeshRanade Nov 21, 2025

Uh oh!

RishikeshRanade Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		def pad_input_for_fp8(features: torch.Tensor, embeddings: torch.Tensor) -> torch.Tensor:
		def pad_input_for_fp8(

Transolver volume #1242

Are you sure you want to change the base?

Transolver volume #1242

Uh oh!

Conversation

coreyjadams commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PhysicsNeMo Pull Request

Description

Checklist

Dependencies

Review Process

Uh oh!

greptile-apps bot commented Nov 20, 2025

Greptile Overview

Greptile Summary

Overview

Critical Issue

Architecture Changes

Checklist Status

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

RishikeshRanade left a comment

Choose a reason for hiding this comment

Uh oh!

RishikeshRanade Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

RishikeshRanade Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

RishikeshRanade Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

RishikeshRanade Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

RishikeshRanade Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coreyjadams commented Nov 20, 2025 •

edited

Loading