Swap plane Poisson solver with slab Poisson solver in GK IWL simulations & 2x LBO energy conservation by manauref · Pull Request #928 · ammarhakim/gkeyll

manauref · 2026-01-07T17:47:36Z

Problem

Currently the IWL simulations are using the Poisson solver on planes (fem_poisson_deflated). This has been shown to break energy conservation. One reason we have continued to do that is that:
a) We didn't have the ability to bias the limiter corner in the slab Poisson solver (fem_poisson_perp).
b) We didn't have a working workflow that used the slab Poisson solver and twist-shift BCs (TS BCs).

Here we address both of these points, and deprecate the use of fem_poisson_deflated in favor of fem_poisson_perp.

Solution

Biasing

We implemented the ability to bias the limiter corner in fem_poisson_perp. Some info is in the DR #898 .

Unlike biasing in fem_poisson, where can bias whole planes in the solver, in fem_poisson_perp we bias lines. That's at the moment we just need to bias the lines at (x,z)=(x_LCFS,z_min) and (x,z)=(x_LCFS,z_max).

Tests with biasing were added to the fem_poisson_perp unit test.

Workflow with TS BCs

We implemented a new set of steps to make sure the potential is twist-shift periodic with this new slab solver. These are:

Smooth the charge density, without BCs (fem_parproj).
Solve the perpendicular Poisson problem (fem_poisson_perp).
In the core, apply TS BC to phi at the lower z-boundary, and a ghost-from-skin-surf BC at the upper z-boundary.
Smooth phi, with Dirichlet BCs that take the Dirichlet value from the ghost in the core, and witih Dirichlet BCs that take the Dirichlet value from the skin in the SOL.

Commentary on alternatives

Regarding step 1: It's unclear whether BCs are needed here, or what they should be. In SOL simulations we don't apply BCs, and that's the option that makes the operator self-adjoint and preserves energy. Perhaps indirectly using TS BCs as done in steps 3-4 would be worthwhile, but doesn't seem necessary so far.
Note that steps 3-4 only apply TS BC at the lower boundary. This is as it's done in main now. We tried, both in main and in this branch, but applying TS BC only at the upper boundary or in combination with the lower, does not work.

Additional perks

We noticed that the LBO collision operator was not conserving energy in 2x. We fixed that in this branch.

This fix is not the final one, though. Through more testing I found scenarios (large gradients + normNu) in which the LBO still doesn't conserve energy. I think we need to compute nu-weighted moments to truly conserve energy, which we may pursue in another branch.

Tests

This branch is valgrind and compute-sanitizer clean.

Regression tests

Regression tests run, but they are expected to yield slightly different solutions since we are using a different algorithm now.

rt_gk_d3d_iwl_2x2v_p1

rt_gk_tcv_iwl_adapt_source_2x2v_p1

rt_gk_d3d_iwl_3x2v_p1

rt_gk_tcv_iwl_adapt_source_3x2v_p1

Production simulation

We ran a TCV case provided by @Antoinehoff in both main and this branch. Here are some snapshots.

…on_perp. Unlike the implementation in fem_poisson where one can specify an arbitrary (but aligned with the grid) biased plane, here we specify biased lines, but for the moment it is only restricted to specifyin a line parallel to y (perpendicular to x and z). Not yet tested.

…estricted to a line perpendicular to x and z. Add a unit test, which suggests it seems to work (gkylcas 2c7f34696aa65e552572369d203c031edc2146b4).

… Add GPU unit tests, which pass.

… Add correct logic for parallel smoothing in IWL to the GK field app. Regression tests results are qualitatively and quantiatively similar. A production run will give more confidence.

… a) the 1x and 2x volume kernels were out of date (I didn't change the maxima at all to generate these new kernels), b) When int/corn/surf geo was implemented we used the interior bmag_inv in the GK LBO when we should've used the corner bmag_inv. So here we remove geo_int.bmag_inv (and geo_int.bmag_inv_sq because it's not used anywhere) and add geo_corn.bmag_inv. Now the LBO conserves energy when B(x) and n(x),T(x).

…al magnetic equilibrium.

…sed line in fem_poisson_perp; it wasn't working right when the biased line was placed on the domain boundary. It behaves as expected now, but the new logic isn't totally compatible with TS BCs. That's because the second smoothing changes phi at the biased line (it's no longer 0 if the biased value was 0), so the TS BC of phi will have a non-trivial effect there. Perhaps the thing to do is to apply TS BC to phi before the second smoothing, and used the TS-ed phi as a BC in the smoothing.

…ter the perp solve, apply TS BC to phi on at both boundaries. b) Smooth phi along z with Dirichlet BCs both in the core and the SOL, otherwise the biased limiter value is not preserved (an alternative could be to change fem_parproj so it enforces a biased line value just at the limiter corner in the SOL smoothing).

…use DIRICHLET values from the ghost cell.

…parproj when using Dirichlet BCs.

…values are read from the ghost cell, and adapt the unit test which passes now. Really we should make this an option in the updater because we've gone back and forth on this a couple of times over the last 2 years. For now we simply change this option entirely, for testing (gkylcas 3607635a8b0e4da7d08516021bfe53b72cbd86ab).

…b.com/ammarhakim/gkeyll into poisson_perp_bias-new_IWL_field_solve

…ost cells in the wrong place. We need to fill the ghost cells of the z-global array that goes into fem_parproj. It's currently a bit hacky here, but the simulation appears well behaved. We are presently only applying TS at the lower boundary of the core; if we apply TS on both sides, the solution develops weird gradients in z and the sim crashes in 2-3 micros

…t, and I want to check that it gives similar results (I still don't understand why applying TS at both boundaries behaves so badly).

…ranch with: 1) flattening of the solution in the core ghost before applying TS (not yet tested), 2) some memory fixes in fem_poisson_perp biasing. But I'm seeing cases where biasing works at the bottom but not at the top, and want to switch to laptop for more rapid testing

…n the lower one. Going back to cluster.

…eld to previous state (without the flattening of the ghost before TS, we'll test that after).

…r the ghost cell (since we've gone back and forth between the two, and likely different operations will need one or the other in the future). Adapt unit test so it tests both options. Unit tests pass on CPU, not yet tested on GPU.

…b.com/ammarhakim/gkylzero into poisson_perp_bias-new_IWL_field_solve

…pass on CPU and GPU. Unit test is compute-sanitizer clean.

…using error. Need valgrind from cluster.

…per core boundary. Now rt_gk_d3d_iwl_3x2v_p1 is valgrind clean.

…c app, where all core/sol ranges are created. Rename skin/ghost ranges in gyrokinetic and species apps to include local_ so they are consistent with their global counterpart. Checked the following regression tests and they all had unchanged results: rt_gk_mirror_boltz_elc_1x2v_p1 rt_gk_multib_asdex_2x2v_p1 rt_gk_multib_step_2x2v_p1 rt_gk_neut_recycle_1x3v_p1 rt_gk_sheath_1x2v_p1 rt_gk_sheath_2x2v_p1 rt_gk_sheath_3x2v_p1 rt_gk_sheath_fluid_neut_1x2v_p1 rt_gk_tcv_iwl_adapt_source_2x2v_p1 rt_gk_tcv_iwl_adapt_source_3x2v_p1

…cise.

Antoinehoff · 2026-02-10T14:26:03Z

Here are one-to-one comparison of production like simulations between the current state of main (5506bbe) and this branch (8fee1f5). The simulation setup is GK TCV iwl 3x2v, see rt_gk_tcv_iwl_adapt_source_3x2v_p1.c, for a coarse resolution (24x16x12x12x6) as presented in Hoffmann et al. 2026. The only parameter alterations are an increase of the input power, 0.5MW instead of ~0.25MW, to accelerate turbulence development, and a reduction of the collisionality scaling factor, $\nu_{frac}=0.5$ instead of $1.0$, to increase the time step.
The results from the main and this branch are labelled main and poissonperp, respectively.

Edit: I updated the plots and analysis with longer runs as a noticeable difference rise when approaching the quasi-steady state.

Note: The poisson perp run crashed at t~1320mus, the main run did not reach that time yet so it is unclear if the crash is solely due to the solver. I've observed that reducing nu_frac can destabilize the simulation at longer time.

General numerics comments

The poissonperp run presents a very stable time step ~4.6ns against a fluctuating one for the main run, ~3 to 4.6ns. The poissonperp simulation is consequently faster, achieving ~400mus against 330mus for main, in 6h and 4 GPUs.

Energy

It seems that the poissonperp leads to a lower confined energy state.
We see a strong reduction of the ion integrated Hamiltonian because of the global reduction of the potential value. This does not affect the total Hamiltonian. However, the electron Hamiltonian is lower in poissonperp than in main despite the stronger negativity of the potential, which indicates that the thermal energy of the electrons is strongly reduced in the poissonperp.
main

poissonperp

Potential

Poisson perp saturates into a lower potential values everywhere, increasing the ExB shear at the inner radial boundary and decreasing tit at the outer one.
main

poissonperp

Density

Fairly similar, no comments.
main

poissonperp

Temperature

The electron quasi-steady state temperature is remarkably reduced with the Poisson perp solver as it is expected from the energy analysis above. The SOL electron temperature matches better with experiment but it is unclear for the core (recall that the initial conditions of this simulation is ~ the experimental profile measurement). For the ions, a stronger temperature "well" is observed in the Poisson perp simulation.
main

poissonperp

Density fluctuations

We see that the fluctuations at the inner radial boundary are reduced in the poisson perp case, which may be due to the increase ExB shear in that region (potential well is larger in poisson perp).
main

poissonperp

Temperature fluctuations

It seems that the y-flows are stronger in the main simulation.
main

poissonperp

Antoinehoff

Thanks for this great work!!

Antoinehoff

Longer runs show significant discrepancies in the quasi steady state, who's right? The above analysis is updated.

manauref · 2026-02-10T23:28:30Z

Longer run are a bit worrying: everything looks like it is loosing energy. I think we need to see the steady state of this simulation Also I need help to plot the energy balance.

can you edit the comment adding the equivalent plots with the simulation run in main? i can message you privately about the energy balance problem

Antoinehoff · 2026-02-10T23:30:33Z

Longer run are a bit worrying: everything looks like it is loosing energy. I think we need to see the steady state of this simulation Also I need help to plot the energy balance.

can you edit the comment adding the equivalent plots with the simulation run in main? i can message you privately about the energy balance problem

I just restarted it, it is not as advanceed yet but here is how the integrated moments evolved for now. I'll update previous comment tomorrow.

…ing us to reset cfl_frac_omegaH. Update some names in tok_geo.

manauref · 2026-03-11T23:37:05Z

@Antoinehoff one concern you had is that your energy conservation time trace looked worse in this branch, right? But the adaptive sources aim to conserve M2 energy, not H energy, right?

…s of the LHS matrix on the GPU. The idea is to pack the nonzero values directly into the csr_val array in cudss_ops.cu, for which we'll need to pre-store the linear indices into this array. It'll take a bit of memory (exactly num_basis^2 * local_range.volume * sizeof(int)), but I couldn't think of a simpler way (gkylcas ae93c1e50ecfb936b4d5108866d3cd61b4b09771).

…sn't complain.

…_perp solvers so we can update the LHS matrix on the fly. Add a unit test of this, modifying existing tests to reduce code duplication. It passes the test on GPU. The CPU code is not ready; I see in superlu_ops.c that I tried to do this with the SuperLU solver in the past, but apparently it errored out, so I have to examine that again.

…so that we can update the LHS matrix without having to refactor (assuming same sparsity pattern). Now that I understand SuperLU better, I see why it was erroring out when I first tried this few years ago. We needed to use the expert driver (which the documentation doesn't make clear, but their source code gives hints of this). There may be some other memory savings/accelerations we can explore still, like not re-creating the B matrix, and setting the work memory only once. SuperLU and fem_poisson_perp unit tests pass, including new ones that update the A matrix.

…e are not destroying and creating the matrix every time. SuperLU and fem_poisson unit tests pass, but fem_poisson_perp occasionally hangs in the 2x periodic test, so I suspect there's a memory error somewhere. Unfortunately Perlmutter is down so I have no way of checking.

…er updating amat.

…ests would hang. It was just a bug in the unit test.

@JunoRavin

…once. @JunoRavin unlocked this, via use of Opus. In particular it suggested a Glu array (which I understand now looking at SuperLU's source code, and a particular calculation of the memory size needed (now in superlu_alloc_work_if_needed) which I don't understand where it comes from because the memory estimate I see in SuperLU's source code is different. But everything else I tried, including ways following SuperLU docs and source code comments, failed (either gave erroneous answers, seg faults, or was not valgrind clean), while this particular estimate of the memory needed seems to work for all unit tests, and is valgrind clean.

…fem_poisson_perp_bias

…erp_bias

…g-fem_poisson_perp_bias Merging "Update LHS matrix in fem_poisson_perp" into "Biasing in fem_poisson_perp"

manauref and others added 30 commits November 10, 2025 12:38

Add infra in fem_poisson_perp to bias a line in the domain, for now r…

20cbfa8

…estricted to a line perpendicular to x and z. Add a unit test, which suggests it seems to work (gkylcas 2c7f34696aa65e552572369d203c031edc2146b4).

Add accepted results for 2x fem_poisson_perp tests with bias.

eb6ed85

Add 3x tests of biasing in fem_poisson_perp. Need to test on GPUs next.

8288a78

Small edits in fem_poisson biasing code

08f5414

Modify fem_poisson_perp device code to make the biasing work on GPUs.…

3c425e9

… Add GPU unit tests, which pass.

Modify all IWL reg tests to use new interface for specifying biasing.…

3ff3ad1

… Add correct logic for parallel smoothing in IWL to the GK field app. Regression tests results are qualitatively and quantiatively similar. A production run will give more confidence.

Add qprofile to gk_geometry so we can extract q profiles from numeric…

48202cc

…al magnetic equilibrium.

Rename function for clarity

2722fb4

Fix logic in gk_field and apply similar changes in fem_parproj_cu to …

36c4129

…use DIRICHLET values from the ghost cell.

Add IWL q profile calculation

857319a

Use bc_basic to fill the ghost cell in the sol, which is used by fem_…

cd4ebce

…parproj when using Dirichlet BCs.

Merge branch 'poisson_perp_bias-new_IWL_field_solve' of https://githu…

cac6f4b

…b.com/ammarhakim/gkeyll into poisson_perp_bias-new_IWL_field_solve

Fix error in unit test. Now all dirichlet fem_parproj tests pass on GPU.

fec91e6

Apply TS BC to phi at the upper core boundary instead, because why no…

c24747d

…t, and I want to check that it gives similar results (I still don't understand why applying TS at both boundaries behaves so badly).

Replace some sqrt(3.0) by its float in bc_basic_gyrokinetic.

f41502a

Found bug in fem_poisson_perp biasing that affected z-slabs other tha…

796143d

…n the lower one. Going back to cluster.

Remove debugging prints, and restore post-poisson_perp steps in gk_fi…

2f764d1

…eld to previous state (without the flattening of the ghost before TS, we'll test that after).

Restore fem_poisson_perp unit test.

76e6536

Merge branch 'poisson_perp_bias-new_IWL_field_solve' of https://githu…

e94fab9

…b.com/ammarhakim/gkylzero into poisson_perp_bias-new_IWL_field_solve

Fixes to device code given recent changes in fem_parproj. Unit tests …

dc8005d

…pass on CPU and GPU. Unit test is compute-sanitizer clean.

Clean up gk_field. Locally the d3d reg test is throwing a really conf…

534a10a

…using error. Need valgrind from cluster.

Fix size of buffer used in gk_field for applying BCs on phi at the up…

65eddaf

…per core boundary. Now rt_gk_d3d_iwl_3x2v_p1 is valgrind clean.

manauref added 7 commits January 7, 2026 14:20

Alignment change

078d8f5

Add a couple of comments.

c442a2c

Fix use of wrong range. Must have happened during main merge.

50aa035

Rename fem_parproj function pointers in gk_field so they are more pre…

0038059

…cise.

Use the correct smoothing option for the charge density in 2x IWL

8fee1f5

Merge branch 'main' into poisson_perp_bias

4729a0a

Change left over enum specification in unit test.

c3f0fc9

Antoinehoff approved these changes Feb 10, 2026

View reviewed changes

Antoinehoff requested changes Feb 10, 2026

View reviewed changes

manauref added 2 commits February 27, 2026 20:11

Merge branch 'main' into poisson_perp_bias

63cf001

Fix normalization issue in fdot_multiplier. Add a _reset method allow…

ce90494

…ing us to reset cfl_frac_omegaH. Update some names in tok_geo.

manauref added 14 commits March 15, 2026 16:18

Cast void pointer in fem_poisson_perp lhs kernels so C++ compiler doe…

7697e06

…sn't complain.

Fix error in unit test and mem management in superlu when solving aft…

56bddcc

…er updating amat.

Fix memory error mentioned in earlier commits where one of the unit t…

ad2bc1d

…ests would hang. It was just a bug in the unit test.

Merge main -> poisson_perp_bias

06d6639

Merge fem_poisson_perp_bias -> fem_poisson_update-prototyping.

5734c39

Merge branch 'poisson_perp_bias' into fem_poisson_update-prototyping-…

b52bdfe

…fem_poisson_perp_bias

Merge branch 'main' into poisson_perp_bias

fdabef9

Merge branch 'main' into fem_poisson_update-prototyping-fem_poisson_p…

376244e

…erp_bias

Merge pull request #981 from ammarhakim/fem_poisson_update-prototypin…

2511dcb

…g-fem_poisson_perp_bias Merging "Update LHS matrix in fem_poisson_perp" into "Biasing in fem_poisson_perp"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Swap plane Poisson solver with slab Poisson solver in GK IWL simulations & 2x LBO energy conservation#928

Swap plane Poisson solver with slab Poisson solver in GK IWL simulations & 2x LBO energy conservation#928
manauref wants to merge 61 commits intomainfrom
poisson_perp_bias

manauref commented Jan 7, 2026 •

edited

Loading

Uh oh!

Antoinehoff commented Feb 10, 2026 •

edited

Loading

Uh oh!

Antoinehoff left a comment

Uh oh!

Antoinehoff left a comment •

edited

Loading

Uh oh!

manauref commented Feb 10, 2026

Uh oh!

Antoinehoff commented Feb 10, 2026

Uh oh!

manauref commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

manauref commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Biasing

Workflow with TS BCs

Commentary on alternatives

Additional perks

Tests

Regression tests

rt_gk_d3d_iwl_2x2v_p1

rt_gk_tcv_iwl_adapt_source_2x2v_p1

rt_gk_d3d_iwl_3x2v_p1

rt_gk_tcv_iwl_adapt_source_3x2v_p1

Production simulation

Uh oh!

Antoinehoff commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

General numerics comments

Energy

Potential

Density

Temperature

Density fluctuations

Temperature fluctuations

Uh oh!

Antoinehoff left a comment

Choose a reason for hiding this comment

Uh oh!

Antoinehoff left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

manauref commented Feb 10, 2026

Uh oh!

Antoinehoff commented Feb 10, 2026

Uh oh!

manauref commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

manauref commented Jan 7, 2026 •

edited

Loading

Antoinehoff commented Feb 10, 2026 •

edited

Loading

Antoinehoff left a comment •

edited

Loading