Adding TorchScalarVariable and TorchNDVariable by pluflou · Pull Request #140 · lume-science/lume-torch

pluflou · 2026-02-04T19:41:44Z

This pull request introduces several improvements and refactors across the codebase to better support array and tensor variables in LUME models, enhance input/output validation, and simplify utility functions. The main focus is on expanding variable support beyond scalars, improving tensor handling, and streamlining validation logic for model inputs and outputs.

Expanded Variable Support and Validation

Updated input_variables and output_variables in LUMEBaseModel to support TorchNDVariable and DistributionVariable, allowing models to handle arrays/tensors and distributions as inputs/outputs.
Refactored input validation logic in LUMEBaseModel and derived models to properly validate and handle TorchNDVariable and DistributionVariable instances, raising errors for unknown input names. [1] [2]
Improved output validation in TorchModel to differentiate between scalar and tensor outputs, ensuring correct validation for each variable type.

Utility and Model Refactoring

Enhanced the itemize_dict utility to support flattening and itemizing both numpy arrays and torch tensors, making it more robust for different input types. [1] [2]
Refactored the _arrange_inputs method in TorchModel to support batching and stacking of tensor inputs, handle default values, and enforce consistent input shapes, with clear error handling for mixed variable types. [1] [2]

Miscellaneous Improvements

Added _tkwargs property to ProbModelBase for consistent tensor device/dtype handling, and removed redundant code from GPModel. [1] [2]
Improved handling of default values for tensor inputs and outputs, ensuring proper cloning and detaching to avoid unwanted side effects. [1] [2]
Updated DistributionVariable.validate_value to correctly use ConfigEnum.NULL for validation configuration, improving clarity and correctness. [1] [2]

These changes collectively make the codebase more flexible for machine learning workflows that require complex input/output types, improve validation reliability, and simplify the handling of tensors and arrays throughout the model lifecycle.

…hmodel

pluflou · 2026-02-04T19:43:45Z

lume_torch/models/torch_model.py


    def _arrange_inputs(
        self, formatted_inputs: dict[str, torch.Tensor]
    ) -> torch.Tensor:


@roussel-ryan can you review this function?

lume_torch/variables.py

roussel-ryan · 2026-02-05T22:34:56Z

lume_torch/variables.py

+            if value.ndim == 0:
+                pass  # scalar tensor, valid
+            elif value.ndim == 1:
+                pass  # 1D tensor (single scalar or batch of scalars), valid


should this be valid for a ScalarVariable type?

Currently we pass tensors of 1 dim like torch.tensor([1]) or batches torch.tensor([1, 2, 3]) and we don't want to treat these as NDVariables. Do you have other suggestions on how to validate this?

I see, in this case we should either check that the last dimension is 1 or that ndim=0, I don't think a shape (N,) should work. Also the comment below should read
# Batched scalars with shape (batch_size, 1), valid

roussel-ryan · 2026-02-05T22:36:11Z

lume_torch/variables.py

+        if expected_dtype and value.dtype != expected_dtype:
+            raise ValueError(f"Expected dtype {expected_dtype}, got {value.dtype}")
+
+    def _get_image_shape_for_validation(self, value: Tensor) -> Tuple[int, ...]:


I think this is too restrictive for NDVariable, if we want an ImageVariable subclass then this would be more relevant

So do we want both
NDVariable -> ImageVariable(NDVariable) -> TorchImageVariable(ImageVariable)
and
NDVariable -> TorchNDvariable(NDVariable)

and similar implementations for numpy under lume-base?

so what does the ImageVariable class add that we can't do with the shape argument in NDVariable? Is it just a convenience wrapper?

If so I think we would want NDVariable -> TorchNDVariable(NDVariable) -> TorchImageVariable(TorchNDVariable)

So the main reasons I added specific image type validation was:

to validate that the NDVariable is either 2D or 3D only

to validate that the torch images are being correctly defined vs let's say numpy images, since torch expects [Channels, Height, Width] but numpy expects [Height, Width, Channels].

Both of these can be removed and we can just use NDVariable, as long as we assume the user is defining images correctly for each case. We expect to be using numpy images on the LCLS side AFAIK.

Or I can add an image subclass as discussed above. Any preference?

why do we have channels at all? we deal with greyscale images

To keep it general. For now maybe it's best to remove image specific checks and keep it as a general NDVariable class, and implement features/validation as needed if specific image use cases come up.

yes, I think that makes sense

lume_torch/variables.py

pluflou added 16 commits July 23, 2025 12:56

first implementation in variables and TorchModel

6756e39

remove typo added by mistake

714cebf

validate scalars by shape/dim, and refactor scalar validation in torc…

a4836d6

…hmodel

update input validation in prob model base

71f7153

Merge branch 'slaclab:main' into add-arrays

12e0230

add arrayvariable

9ae9a55

rm print statement

3ca8eeb

test arrayvariable and refactor validation for vars

8e77e86

fix arranging of inputs for array, clean up input validation

ea4f5e4

clean up prob model base input validation

c047366

rm redundant property

ddd7d45

resolve merge conflicts

fa49494

fix how enums are behaving

c5c3048

merge in enum fix

f021700

initial implementation of torch-specific ScalarVariable and NDVariable

0e303ca

update pyproject.toml with my lume-base branch

79ad791

pluflou commented Feb 4, 2026

View reviewed changes

lume_torch/variables.py Show resolved Hide resolved

pluflou added 9 commits February 4, 2026 13:09

update pyproject.toml with my lume-base branch

f642b98

add read_only validation and tests for variables

9ff1f8f

clean refs to numpy arrays

2bb40df

add torch tensor encoder for proper serialization

b72cec3

clean up torch_model

8d1d3d7

clean up gp_model

8e5ace7

clean up prob_model_base

8733628

rename ProbModelBaseModel -> ProbabilisticBaseModel

f78c2bb

update model w/ ProbabilisticBaseModel

ddee906

roussel-ryan reviewed Feb 5, 2026

View reviewed changes

lume_torch/variables.py Outdated Show resolved Hide resolved

roussel-ryan reviewed Feb 5, 2026

View reviewed changes

lume_torch/variables.py Show resolved Hide resolved

pluflou deleted the branch main February 12, 2026 19:42

pluflou closed this Feb 12, 2026

pluflou reopened this Feb 12, 2026

pluflou added 4 commits February 12, 2026 12:01

Merge branch 'lume-torch' into ndvariable

c6e3f54

move to TorchScalarVariable and add deprecation warning

9dfd26c

update NDVariable and ScalarVariable based on new lume-base changes

154f000

adjust based on lume-base changes to NDVariable

fa503f1

pluflou changed the base branch from lume-torch to main February 18, 2026 01:03

Conversation

pluflou commented Feb 4, 2026

Expanded Variable Support and Validation

Utility and Model Refactoring

Miscellaneous Improvements

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pluflou Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

roussel-ryan Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pluflou Feb 12, 2026 •

edited

Loading

roussel-ryan Feb 12, 2026 •

edited

Loading