update the JOSS paper (#1926)

zingale · zingale · commit c1725576d1a7 · 2026-01-20T11:07:21.000-05:00
diff --git a/paper/paper.bib b/paper/paper.bib
@@ -1065,14 +1065,37 @@ @article{nn_astro_2022
 }
 
 @article{dnn_astro_2025,
-      title={Deep Neural Networks for Modeling Astrophysical Nuclear reacting flows},
-      author={{Zhang}, Xiaoyu and {Yi}, Yuxiao and {Wang}, Lile and {Xu}, Zhi-Qin John and {Zhang}, Tianhan and {Zhou}, Yao},
-      year={2025},
-	  month={April},
-      eprint={2504.14180},
-      archivePrefix={arXiv},
-      primaryClass={astro-ph.IM},
-      url={https://arxiv.org/abs/2504.14180},
-	  journal={arXiv e-prints},
-	  keywords={Astrophysics - Instrumentation and Methods for Astrophysics}
-}
+    doi = {10.3847/1538-4357/adf331},
+    url = {https://doi.org/10.3847/1538-4357/adf331},
+    year = {2025},
+    month = {sep},
+    publisher = {The American Astronomical Society},
+    volume = {990},
+    number = {2},
+    pages = {105},
+    author = {Zhang, Xiaoyu and Yi, Yuxiao and Wang, Lile and Xu, Zhi-Qin John and Zhang, Tianhan and Zhou, Yao},
+    title = {Deep Neural Networks for Modeling Astrophysical Nuclear Reacting Flows},
+    journal = {The Astrophysical Journal},
+    abstract = {In astrophysical simulations, nuclear reacting flows
+                  pose computational challenges due to the stiffness
+                  of reaction networks. We introduce neural
+                  network-based surrogate models using the DeePODE
+                  framework to enhance simulation efficiency while
+                  maintaining accuracy and robustness. Our method
+                  replaces conventional stiff ordinary differential
+                  equation (ODE) solvers with deep learning models
+                  trained through evolutionary Monte Carlo sampling
+                  from zero-dimensional simulation data, ensuring
+                  generalization across varied thermonuclear and
+                  hydrodynamic conditions. Tested on 3-species and
+                  13-species reaction networks, the models achieve ≲1\%
+                  accuracy relative to semi-implicit numerical
+                  solutions and deliver a ∼2.6× speedup on CPUs. A
+                  temperature-thresholded deployment strategy ensures
+                  stability in extreme conditions, sustaining neural
+                  network utilization above 75\% in multidimensional
+                  simulations. These data-driven surrogates
+                  effectively mitigate stiffness constraints, offering
+                  a scalable approach for high-fidelity modeling of
+                  astrophysical nuclear reacting flows.}
+}
diff --git a/paper/paper.md b/paper/paper.md
@@ -85,7 +85,7 @@ affiliations:
   name: Lawrence Berkeley National Laboratory, Berkeley, CA, USA
 - index: 8
   name: Leiden Observatory, Leiden, The Netherlands
-- index: 8
+- index: 9
   name: Arizona State University, Tempe, AZ, USA
 - index: 10
   name: Department of Physics and Astronomy, Michigan State University, E. Lansing, MI, USA
@@ -119,16 +119,11 @@ are many astrophysics simulation codes built around the AMReX library,
 with each specializing in different astrophysics phenomena.  Each of
 these codes share some common needs.  The Microphysics library was
 created to minimize developer effort across these codes and coordinate
-the approach to exascale compute architectures.  Microphysics has been
-used for simulations of convective Urca [@Boyd_2025] and X-ray bursts
-[@Guichandut_2024] with MAESTROeX; and for simulations of nova
-[@Smith2025], X-ray bursts [@Harpole_2021], thermonuclear supernovae
-[@Zingale_2024_dd], and convection in massive stars [@Zingale_2024]
-with Castro. This Microphysics library has also enabled recent work
-in astrophysical machine learning to train deep neural networks
-modeling nuclear reactions in [@nn_astro_2022] and [@dnn_astro_2025].
+the approach to exascale compute architectures, in particular, GPU
+support for astrophysical simulation codes.
+
 
-# Project history
+# Design
 
 The Microphysics project started in 2013 as a way to centralize the
 reaction networks and equations of state used by Castro and MAESTRO
@@ -138,45 +133,19 @@ Microphysics, which was an attempt to co-develop microphysics routines
 for the Castro and the Flash [@flash] simulation codes.  As interest
 in GPUs grew (with early support added to Microphysics in 2015),
 Castro moved from a mix of C++ and Fortran to pure C++ to take
-advantage of GPU-offloading afforded by the AMReX library and C++
+advantage of GPU-offloading afforded by the AMReX library, and C++
 ports of all physics routines and solvers were added to Microphysics.
-At this point, the development focused solely on AMReX-based codes and
-C++ and the project was formally named the AMReX-Astrophysics
+At this point, the project was formally named the AMReX-Astrophysics
 Microphysics library.  Today, the library is completely written in C++
 and relies heavily on the AMReX data structures to take advantage of
 GPUs.  The GPU-enabled reaction network integrators led to the Quokka
 code adopting Microphysics for their simulations.
 
-# Design
-
 Microphysics provides several different types of physics: equations of
 state, reaction networks and screening methods, nuclear statistical
 equilibrium solvers and tabulations, thermal conductivities, and
 opacities, as well as the tools needed to work with them, most notably
 the suite of stiff ODE integrators for the networks.
-
-There are two ways to use Microphysics: in a standalone fashion (via
-the unit tests) for simple investigations or as part of an
-(AMReX-based) application code.  In both cases, the core
-(compile-time) requirement is to select a network---this defines the
-composition that is then used by most of the other physics routines.
-
-Microphysics uses header-only implementations of all functionality as
-much as possible, to allow for easier compiler inlining.  Generally,
-the physics routines and solvers are written to work on a single zone
-from a simulation code, and in AMReX, a C++ lambda-capturing approach
-is used to loop over zones (and offload to GPUs if desired).  We also
-leverage C++17 `if constexpr` templating to compile out unnecessary
-computations for performance.  For example, our equations of state can
-compute a lot of thermodynamic quantities and derivatives, but for
-some operations, we only need a few of these.  All of the equations of
-state are templated on the `struct` that holds the thermodynamic
-state.  If we pass the general `eos_t` type into the EOS, then
-everything is calculated, but if we pass in to the same interface the
-smaller `eos_re_t` type, then only a few energy terms are computed
-(those that are needed when finding temperature from specific internal
-energy).
-
 Several classic Fortran libraries have been converted to header-only
 C++ implementations, including the VODE integrator [@vode], the hybrid
 Powell method of MINPACK [@powell], and the Runge-Kutta Chebyshev
@@ -188,146 +157,62 @@ We also make use of the C++ autodiff library [@autodiff] to compute
 thermodynamic derivatives required in the Jacobians of our reaction
 networks.
 
+Microphysics uses header-only implementations of all functionality as
+much as possible to allow for easier compiler inlining, which is
+especially important in GPU kernels.  We also leverage C++17 `if
+constexpr` templating to compile out unnecessary computations for
+performance.  Generally, the physics routines and solvers are written
+to work on a single zone from a simulation code, and in AMReX, a C++
+lambda-capturing approach is used to loop over zones (and offload to
+GPUs if desired).  When used with an application code, this design
+permits the simulation state data to be allocated directly in GPU
+memory and left there for the entire simulation, with all physics run
+directly on the GPU.  Since each zone in a simulation usually will
+have a different thermodynamic state, the integration of reaction
+networks can lead to thread divergence issues.  To help mitigate this issue, we can
+cap the number of integration steps and either retry an integration on
+a zone-by-zone basis with different tolerances or Jacobian
+approximations or pass the failure back to the application code to
+deal with.  This strategy has been successful for many large scale
+simulations [@Zingale_2025].
+
+
 Another key design feature is the separation of the reaction network
 from the integrator.  This allows us to easily experiment with
 different integration methods (such as the RKC integrator) and also
 support different modes of coupling reactions to a simulation code,
 including operator splitting and spectral deferred corrections (SDC)
 (see, e.g., @castro_simple_sdc).  The latter is especially important
-for explosive astrophysical flows.
+for explosive astrophysical flows.  Tight integration with pynucastro [@pynucastro; @pynucastro2], allows for the generation of custom reaction networks for a science problem.
 
-Finally, most of the physics is chosen at compile-time.  This allows
+There are two ways to use Microphysics: in a standalone fashion (via
+the unit tests) for simple investigations or as part of an
+(AMReX-based) application code.  In both cases, the core
+(compile-time) requirement is to select a network---this defines the
+composition that is then used by most of the other physics routines.
+This compile-time requirement also allows
 Microphysics to provide the number of species as a `constexpr` value
-(which many application codes need), and also greatly reduces the
+(which many application codes need), and greatly reduces the
 compilation time (due to the templating used throughout the library).
 
-# Capabilities
-
-## Reaction networks
-
-A reaction network defines the composition (including the atomic
-weight and number) and the reactions that link the nuclei together.
-Even if reactions are not being modeled, a `general_null` network can
-be used to simply define the composition.
-
-In multidimensional simulations, there is a desire to make the
-reaction as small as possible (due to the memory and per-zone
-computational costs) while still being able to represent the
-nucleosynthesis reasonable accurately.  As a result, approximations
-to rates are common and a wide variety of networks are used depending
-on the burning state being modeled.
-
-We have ported many of the classic "aprox" networks used in the
-astrophysics community (for example "aprox21" described in
-@wallacewoosley:1981) to C++.  Many of these originated from the
-implementations of @cococubed.  Our implementation relies heavily on
-C++ templates, allowing us to simply define the properties of the
-reactions and then the compiler builds the righthand side and Jacobian
-of the system at compile-time.  This reduces the maintenance costs of
-the networks and also eliminates some common indexing bugs.
-
-We also integrate with the pynucastro nuclear astrophysics library
-[@pynucastro; @pynucastro2], allowing us to generate a custom network
-in a few lines of python simply by specifying the nuclei we want.  This
-makes use of the reaction rates from @ReacLib and others, and allows us
-to keep up to date with changes in rates and build more complex networks
-than the traditional aprox nets.
-
-
-### Screening
-
-Nuclear reaction rates are screened by the electrons in the plasma
-(which reduce the Coulomb barrier for the positively charged nuclei to
-fuse).  Microphysics provides several different screening
-implementations: the widely-used `screen5` method based on
-@graboske:1973; @jancovici:1977; @alastuey:1978; @itoh:1979, the
-methods of @chugunov:2007 and @chugunov:2009, and the method of
-@Chabrier_1998.
-
-
-### Nuclear statistical equilibrium
-
-At high temperatures ($T > 4\times 10^9~\mathrm{K}$), forward and
-reverse reactions can come into equilibrium (nuclear statistical
-equilibrium, NSE).  Integrating the reaction network directly in this
-regime can be difficult, since the large, but oppositely signed rates,
-may not cancel exactly.  In this case, instead of integrating the
-network, we can impose the equilibrium state.  Microphysics has two
-different approaches to NSE: a self-consistent solve for the NSE state
-using the nuclei in the present reaction network (similar to
-@Kushnir_2020) and an interpolation from a tabulated NSE state that
-was generated with $\mathcal{O}(100)$ nuclei (see @Zingale_2024).
-
-### Thermal neutrinos
-
-There are a number of thermal mechanisms for producing neutrinos,
-including plasma, photo, pair, recombination, and Bremsstrahlung
-neutrinos.  These act as an energy loss term to the reaction network
-and are implemented following @itoh:1996.
-
-
-
-
-## Equations of state
-
-The equations of hydrodynamics are closed via an equation of state
-that related internal energy, pressure, and density (along with
-composition).  For systems with reactions or thermal diffusion, it
-also provides temperature.  Traditionally, equations of state are
-implemented in terms of density and temperature, so a Newton-Raphson
-method is used to invert the EOS given energy and density (or some
-other thermodynamic quantities).  A wide range of thermodynamic
-quantities are needed by simulation codes, including pressure,
-internal energy, enthalpy, entropy, and their derivatives with
-respect to density, temperature, and composition.  The various EOS
-`struct` types carry this thermodynamic state.
-
-A variety of EOSs are implemented, to allow for application to a range
-of problems.  These include a simple gamma-law EOS, the stellar EOS of
-@timmes:2000, and an equation of state applicable to primordial
-chemistry.
-
-## Transport coefficients
-
-For thermal diffusion or radiation transport, conductivities and
-opacities are needed.  We provide a C++ port of the stellar
-conductivity opacities from @timmes:2000b.  These are appropriate for
-modeling thermonuclear flames in supernovae and X-ray bursts.
-
-# GPU Strategy
-
-Microphysics is designed such that all computation takes place on
-GPUs.  When used with an application code, this permits the simulation
-state data to be allocated directly in GPU memory and left there for
-the entire simulation.  For the ODE integration, the integrator
-(e.g. VODE) is run on the GPU directly.  Since each zone in a
-simulation usually will have a different thermodynamic state, this can
-lead to thread divergence issues, since some zones will have an easier
-burn than others.  To help mitigate this issue, we can cap the number
-of integration steps and either retry an integration on a zone-by-zone
-basis with different tolerances or Jacobian approximations or pass the
-failure back to the application code to deal with.  This strategy
-has been successful for many large scale simulations [@Zingale_2025].
-
-
-# Unit tests / examples
-
-Microphysics can be used as a standalone tool through the tests
-in `Microphysics/unit_test/`.  There are 2 types of tests here:
-
-* *comprehensive tests*: these test performance by setting up a cube
-  of data (with density, temperature, and composition varying in a
-  dimension) and performing an operation on the entire cube (calling
-  the EOS, integrating a network, ...).  A separate test is provided
-  for each major physics module.
-
-* *one-zone tests*: these simply call one of the physics modules with
-  a single thermodynamic state.  This can be used to explore the
-  physics that is implemented, and also serve to demonstrate the interfaces
-  used in Microphysics.
+# Research Impact Statement
+
+Microphysics has been used for simulations of convective Urca
+[@Boyd_2025] and X-ray bursts [@Guichandut_2024] with MAESTROeX; and
+for simulations of nova [@Smith2025], X-ray bursts [@Harpole_2021],
+thermonuclear supernovae [@Zingale_2024_dd], and convection in massive
+stars [@Zingale_2024] with Castro. This Microphysics library has also
+enabled recent work in astrophysical machine learning to train deep
+neural networks modeling nuclear reactions [@nn_astro_2022; @dnn_astro_2025].
+
+# AI Usage Disclosure
+
+No generative AI/LLM was used for producing code or documentation in
+the git repository or for this paper.  We have experimented with using
+AI/LLM tools for code review and for suggesting places to focus our
+optimization efforts on, but the resulting coding, benchmarking, and
+testing is then done by humans.
 
-These tests also serve as tutorial codes for integrating Microphysics
-into new application codes.
 
 # Acknowledgements