1. Introduction
Wave-induced diffusion is an important tool for plasma confinement, allowing for both current drive (Fisch Reference Fisch1978; Fisch & Boozer Reference Fisch and Boozer1980; Fisch Reference Fisch1987) and heating (Adam Reference Adam1987; Thumm et al. Reference Thumm, Denisov, Sakamoto and Tran2019) of fusion plasmas. As a result of these wave–particle interactions, the particle orbit changes, leading to wave-induced transport across magnetic field lines and flux surfaces. This cross-field transport is fundamentally linked to momentum exchange between the particle and the wave (Lee et al. Reference Lee, Parra, Parker and Bonoli2012; Guan Reference Guan2013; Guan et al. Reference Guan, Dodin, Qin, Liu and Fisch2013a , Reference Guan, Qin, Liu and Fischb ; Ochs & Fisch, Reference Ochs and Fisch2021b , Reference Ochs and Fisch2022; Ochs Reference Ochs2024). This combination of energetic and cross-field transport can drive instabilities, but can also enable beneficial effects such as alpha channelling, which aims to extract high-energy fusion-born ash while harvesting its energy for use in useful waves (Fisch & Rax Reference Fisch and Rax1992a , Reference Fisch and Raxb ; Valeo & Fisch Reference Valeo and Fisch1994; Herrmann & Fisch Reference Herrmann and Fisch1997). Correctly modelling these processes requires a quasilinear diffusion theory that respects momentum conservation.
Historically, there have been two classes of approaches for modelling resonant quasilinear wave–particle interactions. In perhaps the most elegant approach, one begins with a global electromagnetic field Fourier decomposed in action-angle coordinates, and then performs the quasilinear average in these coordinates (Kaufman Reference Kaufman1972; Eriksson & Helander Reference Eriksson and Helander1994; Herrmann Reference Herrmann1998; Brizard & Chan Reference Brizard and Chan2022). Because of the simple particle dynamics in these coordinates, the resulting expressions are fairly compact. However, the Fourier decomposition of even a simple wave field can be quite complex, and thus this approach is not necessarily ideal for all situations, such as modern simulations where the background fields (and thus the mapping to the action-angle coordinates) can change, or those which couple to simulations of the wave field.
In the second approach, one calculates the quasilinear diffusion tensor locally according to the local wavevectors, making use of the uniform-field Kennel–Engelmann diffusion tensor form (Kennel & Engelmann Reference Kennel and Engelmann1966; Lerche Reference Lerche1968). Then, one bounce averages this diffusion tensor over the particle trajectory. The resulting diffusion model is (perhaps surprisingly) robust to the presence of magnetic field gradients and curvature near the resonance (Stix Reference Stix1992, § 17-2), although it fails to accurately model cases where the cyclotron resonance occurs at an orbit turning point, or where the wave is resonant with the bounce frequency rather than the cyclotron frequency. Nevertheless, the model is sufficient for many important fusion-relevant processes, and has been the standard approach to Fokker–Planck calculations of wave–particle interactions in mirror (Bernstein & Baxter Reference Bernstein and Baxter1981; Matsuda & Stewart Reference Matsuda and Stewart1986; Frank et al. Reference Frank2025) and tokamak (Harvey & McCoy Reference Harvey and McCoy1992; Petrov & Harvey Reference Petrov and Harvey2016) plasmas, due to its relative simplicity and computational efficiency.
In order for the latter approach to accurately model wave-induced transport, one must start with a diffusion tensor that respects momentum conservation. Unfortunately, in its conventional form, the Kennel–Engelmann tensor does not, as it satisfies only two of the four requisite momentum-conserving relations.
To proceed, we first review the momentum conservation relations that must be satisfied during a resonant quasilinear wave–particle interaction, reviewing how these result in cross-field diffusion in a magnetised plasma. We also show that these imply a specific form for the diffusion tensor in energy–momentum space. We then review the Kennel–Engelmann diffusion tensor, showing how it respects a subset of the energy–momentum conservation relations. Using our understanding the required form of the diffusion tensor, we then trivially construct an extended Kennel–Engelmann tensor that fully respects energy and momentum conservation. Finally, we show how this new form of the tensor affects the form of constants-of-motion space quasilinear diffusion, in particular recovering the diffusion paths from the action-angle quasilinear theory. Since it relies on the same underlying bounce-averaged quasilinear theory, this extended Kennel–Engelmann tensor should be able to be rapidly adopted into existing Fokker–Planck codes.
2. Generic form of local diffusion tensor for a resonant wave–particle interaction
Consider a particle, possibly in the presence of a uniform magnetic field. Such a particle has a constant kinetic energy
$K$
and canonical momentum components
$\boldsymbol{p}$
, which are often combined in the energy–momentum four-vector
$p^\mu \equiv (K,\boldsymbol{p})$
.
To this system, add a monochromatic wave (oscillating electromagnetic field) with frequency
$\omega$
and wavevector
$\boldsymbol{k}$
. These objects can also be combined into a wave four-vector
$k^\mu \equiv (\omega , k^x, k^y, k^z)$
. Then, allow this wave to resonantly interact with the plasma, either through a Landau or gyro resonance.
It is well known (Trubnikov Reference Trubnikov and Leontovich1979; Stix Reference Stix1992; Dodin & Fisch Reference Dodin and Fisch2012; Guan et al. Reference Guan, Dodin, Qin, Liu and Fisch2013a ) that, during the resonant wave–particle interaction, the increments of energy and momentum satisfy the relation
This relationship follows from the conservation of the wave action
$\mathcal{I}$
, the definition of the Minkowski four-momentum
$\mathcal{P}^\mu \equiv k^\mu \mathcal{I}$
and the fact that the Minkowski subsystem and resonant particle subsystem form a conserving system (Ochs & Fisch Reference Ochs and Fisch2021b
, Reference Ochs and Fisch2022; Ochs Reference Ochs2024).
In a magnetised system, the relation in (2.1) leads to coupled gyrocentre-energy diffusion, as the change in the gyrocentre position is given by
where
$\boldsymbol{B}$
is the magnetic field. In other words, diffusion takes place along single line in gyrocentre-energy space – which is the basis for the alpha channelling effect (Fisch & Rax Reference Fisch and Rax1992a
,Reference Fisch and Rax
b
). Although the early references focused on diffusion due to high-frequency electrostatic waves, in fact the relation holds very generally for any electromagnetic wave. For instance, figure 1 shows diffusion in energy and gyrocentre space for a particle undergoing diffusion due to interaction with a left-polarised electromagnetic wave at the second cyclotron harmonic of a
$\hat {z}$
directed magnetic field. It can be seen that over many successive interactions with the wave, the particle stays on the line defined by (2.1)–(2.2). (The simulation is described in more detail in Appendix A.)

Figure 1. Simulation of ion interaction with a second-harmonic left-hand-polarised wave over many orbits. The particle diffuses in both normalised energy
$\bar {K}$
and gyrocentre position
$(\bar {X},\bar {Y})$
(points, with time corresponding to colour), but stays on the line determined by (2.1) (solid line). The normalised variables and details of the simulation can be found in Appendix A.
The relation in (2.1) means that the diffusion operator in four-momentum space must take a very specific form. Generally, the tensor describing one-dimensional diffusion along a vector
$V^\mu$
satisfies
$D^{\mu \nu } \propto V^\mu V^\nu$
. Since (2.1) shows that wave-induced diffusion occurs along a vector in
$p^\mu$
space with components proportional to
$k^\mu$
, we immediately find
Here,
$D^{KK}$
is the on-diagonal kinetic-energy component of the tensor. Equation (2.3) is very powerful, because it allows us to construct a four-momentum-conserving wave–particle interaction tensor from a single known tensor component.
3. Restoring conservation to the Kennel–Engelmann diffusion tensor
Quasilinear diffusion in due to a monochromatic wave in a uniform magnetic field is traditionally modelled by the equation (Kennel & Engelmann Reference Kennel and Engelmann1966; Lerche Reference Lerche1968; Harvey & McCoy Reference Harvey and McCoy1992; Stix Reference Stix1992; Petrov & Harvey Reference Petrov and Harvey2016)
where the volume element
$\sqrt {g} = 2\pi p_\perp$
, and
$\boldsymbol{D}$
is given by the Kennel–Engelmann diffusion tensor, which can be written
Here,
where
and
where
$E_i$
is the ith component of the oscillating electric field. Here the coordinates have been defined such that
$\boldsymbol{B} \parallel \hat {z}$
, and
$\theta$
is the angle defined by
(Note: the additional phase
$i\theta$
is not included in the definition of
$E^\pm$
in references, such as those for the CQL3D code (Harvey & McCoy Reference Harvey and McCoy1992; Petrov & Harvey Reference Petrov and Harvey2016), which employ the ‘Stix frame’, where
$\boldsymbol{k}$
lies in the
$x-z$
plane and thus
$\theta = 0$
.)
Equation (3.1) describes a diffusion process in two coordinates: the parallel momentum
$p_\parallel \equiv p_z$
, and the perpendicular momentum
$p_\perp \equiv m v_\perp$
. It is important to note that the perpendicular momentum so defined is more precisely an energy variable, having nothing to do with the gyrocentre position. Living in this reduced space, the Kennel–Engelmann diffusion tensor fundamentally cannot capture the full set of conservation relations demanded by (2.1).
Nevertheless, the Kennel–Engelmann tensor does capture the energy–momentum relation in the dimensions that it does model. To see this, we can transform the tensor from coordinates
$(p_\perp ,p_\parallel )$
to coordinates
$(K,p_\parallel )$
. To do this, it is sufficient to transform the vector
$\boldsymbol{w}$
(3.4). Using
we find
Comparing with (3.4), we see that this satisfies
implying (via (3.2)) that the diffusion satisfies the requirement in (2.3) for
$K$
and
$p_\parallel$
. Indeed, the connection of the Kennel–Engelmann diffusion path to the absorption of photon four-momentum has long been appreciated (Kennel & Engelmann Reference Kennel and Engelmann1966; Stix Reference Stix1992).
To make a momentum-conserving tensor is now trivial. Noting that
with
$D_0$
given by (3.3), (2.3) then tells us that in four-momentum space
$(K,p^x, p^y,p^z)$
The projection of this tensor to
$(K,p_\parallel )$
is precisely the Kennel–Engelmann tensor, while the new components capture the absorption of perpendicular momentum.
4. Constants-of-motion space diffusion
A typical application of quasilinear diffusion theory is to bounce-averaged Fokker–Planck theory. Such theories describe diffusion in the space of particle orbits, which can be specified by the constants of motion (COM) of the orbit. It is thus instructive to examine the effects of the new terms in the diffusion tensor on the particle diffusion in COM space.
A typical choice of the COM in an axisymmetric system is
$(\epsilon ,\mu ,p_\phi )$
, where
Here,
$\boldsymbol{r} \equiv (r,\phi ,z)$
is the cylindrical coordinate vector,
$\psi (\boldsymbol{r})$
is a position-dependent potential energy and
$A_\phi (\boldsymbol{r})$
is the
$\phi$
-component of the vector potential. The
$p_\phi$
component is closely related to the particle flux surface, and is often taken to define the flux surface. In these coordinates, the diffusion tensor again has the form from (3.2), and the task is to calculate the diffusion path vector components
$w^\epsilon$
,
$w^\mu$
and
$w^{p_\phi }$
.
It is clear that
$w^\epsilon$
and
$w^\mu$
are unchanged by the new momentum-conserving terms in the diffusion tensor. However,
$w^{p_\phi }$
is certainly changed, as
which involves a sum over the
$\phi$
-projection of all three momentum components, rather than just the parallel momentum. In a linear-confinement axisymmetric system such as a magnetic mirror, where
$\partial p_\phi /\partial p_\parallel = 0$
, the difference to the traditional Kennel–Engelmann theory is particularly striking, as the new components make the difference between non-existent cross-flux-surface transport, versus cross-flux-surface transport due to perpendicular momentum absorption.
Furthermore, we can see easily that the new terms make good sense. Plugging in
$w^{p_i} = k^i v_\perp / \omega$
(see e.g. (2.3), (3.2) and (3.11)) to (4.4), we find
the natural relation between the absorbed photon energy and angular momentum. Thus, we see that the change in the diffusion tensor modifies the momentum absorption so as to enforce angular momentum conservation.
For completeness, we can now fully derive the form of the diffusion tensor in COM space. It is clear that
$w^\epsilon = w^K$
. Equation (4.5) can be expressed in more familiar form by making use of the azimuthal mode number of the wave
$n_\phi$
, from whence
Then the final task is to find
$w^\mu$
. Noting that
and making use of the
$\delta$
-function in (3.3), we find
Thus, in
$(\epsilon ,\mu ,p_\phi )$
space, the diffusion tensor is given by (3.2), but with
\begin{align} \boldsymbol{w} &= v_\perp \begin{pmatrix} 1 \\[5pt] \dfrac{n \varOmega }{|B| \omega } \\[12pt] \dfrac{n_\phi }{\omega } \end{pmatrix}. \end{align}
This has the same form of the diffusion path found in the action-angle space quasilinear theory (see e.g. Herrmann (Reference Herrmann1998, (4.67)), but now calculated with a diffusion coefficient from the conventional bounce-averaged theory.
5. Discussion
In this paper, we have shown how to extend the Kennel–Engelmann diffusion tensor to respect quasilinear four-momentum conservation. By converting the resulting tensor to axisymmetric COM space, we have shown that the extended Kennel–Engelmann tensor modifies the cross-field wave-induced transport, recovering the diffusion path from the action-angle coordinate Hamiltonian theory (Herrmann & Fisch Reference Herrmann and Fisch1997; Herrmann Reference Herrmann1998). Thus, past wave transport calculations that did not rely on bounce averaging the two-dimensional diffusion tensor are unaffected by the new form of the diffusion tensor; this category includes alpha channelling theories derived by reconstructing diffusion paths in gyrocentre-energy space (Fisch & Rax Reference Fisch and Rax1992a
,Reference Fisch and Rax
b
; Valeo & Fisch Reference Valeo and Fisch1994; Ochs et al. Reference Ochs, Bertelli and Fisch2015a
,
Reference Ochs, Bertelli and Fischb
; Romanelli & Cardinali Reference Romanelli and Cardinali2020), or derived using action-angle quasilinear theory in toroidal coordinates (Kaufman Reference Kaufman1972; Eriksson & Helander Reference Eriksson and Helander1994; Herrmann & Fisch Reference Herrmann and Fisch1997; Herrmann Reference Herrmann1998). Instead, the current work allows the correction of calculations of wave-induced cross-field transport in calculations and codes that rely on bounce averaging the two-dimensional Kennel–Engelmann diffusion operator, such as CQL3D (Harvey & McCoy Reference Harvey and McCoy1992; Petrov & Harvey Reference Petrov and Harvey2016). Fortunately, the formulation here allows such codes to be quickly updated by adding a few terms to the existing operators. Such improvements are important in any geometry, but are likely to be particularly critical for transport in magnetic mirrors, where radial transport is driven exclusively by the absorption of perpendicular momentum. In updating these codes, it should be noted that the fact that diffusion occurs along a one-dimensional path with projections along
$\epsilon$
,
$\mu$
and
$\phi$
means that the diffusion operator in these coordinates must be intrinsically three-dimensional, with gradients along all three dimensions determining the fluxes. As a result, the diffusion cannot be cleanly separated into separate operators for within-flux-surface and across-flux-surface diffusion. The exception occurs when
$n_\phi$
is either negligibly small or very large, in which case the diffusion occurs either largely within or across the flux surface, respectively. These limiting cases, and their associated waves, are explored at length in Herrmann (Reference Herrmann1998).
It should be emphasised that the extended Kennel–Engelmann diffusion tensor derived here only describes the action of the wave on resonant particles. To get the total cross-field transport resulting from the wave, one must include non-resonant effects such as the ponderomotive recoil (Ochs & Fisch Reference Ochs and Fisch2021a , Reference Ochs and Fisch2023), which can also drive cross-field transport. In general, this distinction is only an issue for time-dependent wave envelopes; if the wave envelope is stationary, then the resonant diffusion represents the total cross-field transport due to the wave (Ochs & Fisch Reference Ochs and Fisch2021b , Reference Ochs and Fisch2022; Ochs Reference Ochs2024).
Finally, we note here that demanding the tensor form in (2.3) has the potential to ease the calculation of other resonant diffusion tensors, since the entire tensor can be constructed from a single component. Consider, for instance, the non-Bessel-function form of the magnetised susceptibility (Qin, Phillips & Davidson Reference Qin, Phillips and Davidson2007), which should allow for asymptotic expansion of the quasilinear diffusion tensor in the large-gyroradius (large-
$z$
) limit. Rather than performing the asymptotic expansion on a two-dimensional diffusion tensor, the approach taken here allows one to expand a single component, and then end up with a four-dimensional momentum-conserving tensor that includes the induced cross-field transport. Thus, leveraging the energy and momentum conservation allows one to produce a more powerful object with much less algebra.
Acknowledgements
The author would like to thank N. Fisch for useful discussions.
Editor Cary Forest thanks the referees for their advice in evaluating this article.
Funding
This work was partially supported by Department of Energy Grant No. DE-SC0016072, and partially by Princeton University.
Declaration of interests
The author reports no conflicts of interest.
Appendix A. Description of simulations
The simulations in the main text use a second-order modification to the Boris algorithm (Zenitani & Umeda Reference Zenitani and Umeda2018) to solve the Lorentz force equations in normalised form
Here,
$t$
is normalised to
$\varOmega _0^{-1}$
, and the electric field is normalised to
$E_0 = B_0 v_0 /c$
, where
$v_0$
is an arbitrarily normalised velocity.
We take a system with a primarily
$\hat {z}$
-directed uniform background magnetic field, with a small variation along
$\hat {z}$
For a left-hand-polarised wave, we take the vector potential
from whence
Here,
$\bar {z}_m = \mod (z + L/2,L) - L/2$
. The length
$a$
is taken to be much shorter than the system periodicity
$L$
, so that the wave essentially vanishes at the system ends (figure 2). The phase
$\bar {\phi }$
is generally constant, but is randomised once per orbit as the particle passes the point furthest from the wave envelope.

Figure 2. Amplitude of normalised electric field and axial magnetic field in the simulation. The resonant interaction occurs around
$z_m = 0$
; the measurements of gyrocentre and energy are taken around
$z_m = L/2$
.
The logic of the above system is as follows. The magnetic field adopts its mean value twice per length
$L$
: once at
$z_m = 0$
, and once at
$z_m = L/2$
. At
$z_m = 0$
, we place the wave field, which is resonant exactly at
$z_m = 0$
(as we take
$\bar {k}_z = 0$
and
$\bar {\omega } \in \mathbb{Z}$
). Then, we cleanly measure the gyrocentre position and energy at
$z_m = L/2$
, far from the interfering wave field. The phase randomisation preserves the random phase approximation inherent in the quasilinear theory, preventing coherent structure formation due to resonances between the bounce and wave motion.
The code outputs the gyrocentre position
$\boldsymbol{\bar {X}}$
, and the parallel and perpendicular velocities
$\bar {v}_\parallel$
and
$\bar {v}_\perp$
in the gyrocentre drift frame. The dimensionless energy is given by
$\bar {K} = {1}/{2} (\bar {v}_\perp ^2 + \bar {v}_\parallel ^2 )$
. The gyrocentre is related to the canonical momentum by
Combining (A7) with (2.1), the diffusion should occur on the line
This is the line shown in figure 1.
For the simulations shown in figure 1, the parameters are:
$\Delta = 0.07$
,
$L = 2000$
,
$a = 50$
,
$\bar {E}_{w0} = 0.0015$
,
$\bar {\omega } = 2$
,
$\bar {k}_x = 1$
,
$\bar {k}_y = 0.5$
. The particle was initialised with
$\bar {v}_\perp = \bar {v}_\parallel = 1$
, and followed for 50 transits of the system.




