Hostname: page-component-7dd5485656-hw7sx Total loading time: 0 Render date: 2025-10-23T16:19:05.498Z Has data issue: false hasContentIssue false

On hydromagnetic wave interactions in collisionless, high-β plasmas

Published online by Cambridge University Press:  19 January 2024

S. Majeski*
Affiliation:
Department of Astrophysical Sciences, Princeton University, Peyton Hall, Princeton, NJ 08544, USA Princeton Plasma Physics Laboratory, PO Box 451, Princeton, NJ 08543, USA
M.W. Kunz
Affiliation:
Department of Astrophysical Sciences, Princeton University, Peyton Hall, Princeton, NJ 08544, USA Princeton Plasma Physics Laboratory, PO Box 451, Princeton, NJ 08543, USA
*
Email address for correspondence: smajeski@princeton.edu
Rights & Permissions [Opens in a new window]

Abstract

We describe the interaction of parallel-propagating Alfvén waves with ion-acoustic waves and other Alfvén waves, in magnetized, high-$\beta$ collisionless plasmas. This is accomplished through a combination of analytical theory and numerical fluid simulations of the Chew–Goldberger–Low (CGL) magnetohydrodynamic (MHD) equations closed by Landau-fluid heat fluxes. An asymptotic ordering is employed to simplify the CGL-MHD equations and derive solutions for the deformation of an Alfvén wave that results from its interaction with the pressure anisotropy generated either by an ion-acoustic wave or another, larger-amplitude Alfvén wave. The difference in time scales of acoustic and Alfvénic fluctuations at high-$\beta$ means that interactions that are local in wavenumber space yield little modification to either mode within the time it takes the acoustic wave to Landau damp away. Instead, order-unity changes in the amplitude of Alfvénic fluctuations can result after interacting with frequency-matched acoustic waves. Additionally, we show that the propagation speed of an Alfvén-wave packet in an otherwise homogeneous background is a function of its self-generated pressure anisotropy. This allows for the eventual interaction of separate co-propagating Alfvén-wave packets of differing amplitudes. The results of the CGL-MHD simulations agree well with these predictions, suggesting that theoretical models relying on the interaction of these modes should be reconsidered in certain astrophysical environments. Applications of these results to weak Alfvénic turbulence and to the interaction between the compressive and Alfvénic cascades in strong, collisionless turbulence are also discussed.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press

1. Introduction

Until recently, our understanding of Alfvén waves (AWs) at scales much larger than plasma-kinetic scales has been largely agnostic of the rate of Coulomb collisions. Indeed, linear shear AWs and nonlinear torsional AWs do not change the form of the velocity distribution function of the particles, but rather define the moving frame in which any changes to it are to be measured (e.g. Schekochihin et al. Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009). As a result, it has been predicted that many defining characteristics of Alfvénic turbulence, such as its spectral indices or spatial anisotropies, are at most subtly affected by a lack of collisions. In introducing the concept of AW interruption however, Squire, Quataert & Schekochihin (Reference Squire, Quataert and Schekochihin2016) and Squire, Schekochihin & Quataert (Reference Squire, Schekochihin and Quataert2017b) established that shear AWs in collisionless, high-$\beta$ plasmas are susceptible to non-equilibrium effects that place constraints on their ability to propagate freely, complicating the Alfvénic dynamics in new and novel ways.Footnote 1 This is just one example of how even small deviations from local thermodynamic equilibrium (LTE), made possible by low collisionality, can have large dynamical consequences for the evolution of high-$\beta$ systems. As a result, there has been increased interest in revisiting many problems in collisionless, high-$\beta$ plasmas that are fundamentally connected to Alfvénic dynamics, such as turbulence (Bott et al. Reference Bott, Arzamasskiy, Kunz, Quataert and Squire2021; Arzamasskiy et al. Reference Arzamasskiy, Kunz, Squire, Quataert and Schekochihin2023; Squire et al. Reference Squire, Kunz, Arzamasskiy, Johnston, Quataert and Schekochihin2023), the fluctuation dynamo (Santos-Lima et al. Reference Santos-Lima, de Gouveia Dal Pino, Kowal, Falceta-Gonçalves, Lazarian and Nakwacki2014; Rincon, Schekochihin & Cowley Reference Rincon, Schekochihin and Cowley2015; St-Onge & Kunz Reference St-Onge and Kunz2018; St-Onge et al. Reference St-Onge, Kunz, Squire and Schekochihin2020; Sironi, Comisso & Golant Reference Sironi, Comisso and Golant2023; Zhou et al. Reference Zhou, Zhdankin, Kunz, Loureiro and Uzdensky2024) and magnetic reconnection (Cassak et al. Reference Cassak, Baylor, Fermo, Beidler, Shay, Swisdak, Drake and Karimabadi2015; Alt & Kunz Reference Alt and Kunz2019; Winarto & Kunz Reference Winarto and Kunz2022; Egedal et al. Reference Egedal, Gurram, Greess, Daughton and Lê2023).

In well-magnetized and collisionless plasmas, the importance of these non-equilibrium effects can be parametrized by the product $\beta \varDelta$, where $\varDelta \doteq p_\perp /p_\parallel -1$ is the normalized difference between thermal pressures across ($\perp$) and along ($\parallel$) the local magnetic-field direction. Aside from determining whether a plasma is unstable to various pressure-anisotropy-driven microinstabilities, the quantity $\beta \varDelta$ also quantifies the competition between the plasma's pressure anisotropy and the magnetic tension, with the characteristic propagation speed of magnetic disturbances being the effective Alfvén speed $v_{{\rm A},{\rm eff}} \doteq v_{{\rm A}}\sqrt {1+\beta \varDelta /2}$. When $\beta \varDelta \sim \pm 1$, the restoring tension force can be significantly enhanced or even entirely eliminated, rendering Alfvénic fluctuations impotent. If $\beta \gtrsim 10$, then such circumstances are rather easy to achieve by even small deviations from LTE.

The collisionless high-$\beta$ plasmas in which these effects appear are in fact pervasive throughout the universe. Indeed, from radiatively inefficient accretion flows onto supermassive black holes, to the hot and dilute intracluster medium of galaxy clusters, to certain regions of the solar wind at and beyond the Earth's orbit, it is often the case that Coulomb collisions are rare and magnetic fields, while strong enough to magnetize particles, are energetically weak (Kunz et al. Reference Kunz, Squire, Balbus, Bale, Chen, Churazov, Cowley, Forest, Gammie and Quataert2019). Further, turbulence is a ubiquitous phenomenon within these systems, and it plays a crucial role in processes like plasma heating (e.g. Sharma et al. Reference Sharma, Quataert, Hammett and Stone2007; Kunz et al. Reference Kunz, Abel, Klein and Schekochihin2018; Kawazura, Barnes & Schekochihin Reference Kawazura, Barnes and Schekochihin2019; Arzamasskiy et al. Reference Arzamasskiy, Kunz, Squire, Quataert and Schekochihin2023) and angular-momentum transport (e.g. Sharma et al. Reference Sharma, Hammett, Quataert and Stone2006; Kunz, Stone & Quataert Reference Kunz, Stone and Quataert2016; Bacchini et al. Reference Bacchini, Arzamasskiy, Zhdankin, Werner, Begelman and Uzdensky2022; Sandoval et al. Reference Sandoval, Riquelme, Spitkovsky and Bacchini2023). To describe these processes within high-$\beta$ astrophysical environments accurately, some refinement of our understanding of their turbulent fluctuations is necessary. Therefore, with a focus on the role of $\beta \varDelta$, this work revisits the following three tenets of Alfvénic turbulence that are known to hold in magnetohydrodynamics (MHD) and even $\beta \sim 1$ collisionless gyrokinetics: (i) that co-propagating AWs do not interact with one another; (ii) that weak interactions between AWs do not modify their field-aligned wavenumbers; and (iii) that compressive fluctuations in the inertial range are only passively mixed by the Alfvénic cascade.Footnote 2

The assertion that co-propagating AWs do not interact with each other in Alfvénic turbulence (point (i)) is a simple result of $v_{\rm A}$ being relatively insensitive to nonlinearities in $\beta \sim 1$ plasmas. This effect arises naturally from the MHD equations and governs wave interactions across both weak and strong turbulence. Specific to weak AW interactions however (point (ii)), one can effectively apply energy and momentum conservation to pairs of interacting waves since they remain correlated for many linear times. Those constraints, combined with the fact that all Alfvénic fluctuations propagate at $v_{\rm A}$, prevent the generation of a cascade in $k_\|$. Lastly, for point (iii), inertial-range compressive fluctuations being passively mixed by AWs results from the lack of overlap between the linearized eigenvectors of acoustic and Alfvénic fluctuations in $\beta \sim 1$ plasmas when $k_\perp \gg k_\|$. None of the perturbed quantities of the acoustic waves enter to linear order in the propagation of AWs. However, among other things, AWs distort the direction of the magnetic field along which acoustic waves linearly perturb the thermal pressure. As such, they are sensitive to the presence of Alfvénic fluctuations. Yet, all of these phenomena may be expected to change in collisionless high-$\beta$ plasmas.

To understand the need for revisiting these wave interactions, consider the example of AW interruption (Squire et al. Reference Squire, Quataert and Schekochihin2016, Reference Squire, Schekochihin and Quataert2017b). In the interruption process, a transverse perturbation ($\delta B_\perp$) to the magnetic field begins to oscillate at the Alfvén frequency and initially decreases in amplitude. As a result, the field strength $|B|\sim B_0 + \delta B_\perp ^2/2B_0$ decreases nonlinearly. Conservation of the double adiabats $p_\perp /\rho B$ and $p_\parallel B^2/\rho ^3$ then dictates that negative pressure anisotropy $\varDelta <0$ is produced. If the amplitude is large enough ($\delta B_\perp /B_0 \gtrsim 2\beta ^{-1/2}$), a pressure anisotropy of $\varDelta \sim -2\beta ^{-1}$ can be produced by the changing $|B|$, although limited to $\varDelta = -2\beta ^{-1}$ by the Alfvénic firehose instability threshold. However, the rate of anisotropy production is Alfvénic, while $p_{\perp /\|}$ are diffused by heat fluxes operating on sonic time scales ($\tau _{\rm sonic} \propto L /c_{\rm s} \ll L/v_{\rm A}$, where $c_{\rm s} \equiv \sqrt {(2p_\perp /3+p_\|/3)/\rho }$ is the sound speed). The anisotropy is then smoothed out by the heat fluxes and fills the domain with a nearly constant $\varDelta = -2\beta ^{-1}$. At this point, the magnetic tension that supports the AW is nullified and the fluctuation ceases propagation, hence the term ‘interruption’.Footnote 3 If an AW is capable of experiencing a dramatic change in its propagation due to its own nonlinearly generated anisotropy, then one would expect that it would similarly respond to the anisotropy generated by other fluctuations within a turbulent system. It is for this reason that we revisit wave–wave interactions within high-$\beta$ collisionless plasmas.

In this paper, we focus on the dynamical and thermodynamical interactions between two waves: either ion acoustic waves (IAWs) and AWs, or two co-propagating AWs. In § 2, we outline expectations for these interactions based off of analytical theory performed within the CGL-MHD framework with the help of simplified Landau-fluid heat fluxes (Chew, Goldberger & Low Reference Chew, Goldberger and Low1956; Snyder, Hammett & Dorland Reference Snyder, Hammett and Dorland1997). Following this, in § 3, we test these predictions using a finite-volume Riemann-based code that numerically solves the CGL-MHD equations using the Landau-fluid heat fluxes. Finally, § 4 lays out the implications our results have for both weak and strong turbulence, and details some of the barriers that remain to establish a complete theory of Alfvénic turbulence in high-$\beta$ collisionless plasmas.

2. Theory

The degrees of freedom associated with wave–wave interactions in a collisionless plasma are numerous, and so to focus our investigation, we make several simplifying assumptions. First, we take the electrons to be much colder than the ions, so that we may set their temperature $T_{\rm e} = 0$. Certain high-$\beta$ astrophysical plasmas are thought to produce these circumstances, such as radiatively inefficient accretion flows onto supermassive black holes (e.g. Quataert Reference Quataert2003). However, the results we present are easily generalizeable to non-zero electron temperature (see § 4). Second, our focus is on the dynamics that occurs on wavelengths much larger than the ion Larmor radius and at frequencies much smaller than the ion gyrofrequency. Accordingly, we employ a fluid model that accounts for the adiabatic production of pressure anisotropy, its spatial redistribution by field-aligned heat fluxes and its dynamical feedback on the plasma through the action of an anisotropic pressure, all while neglecting finite-Larmor-radius effects. These ‘CGL-MHD’ equations (Chew et al. Reference Chew, Goldberger and Low1956) are

(2.1a)\begin{gather} \frac{\partial \rho}{\partial t}+\boldsymbol{\nabla} \boldsymbol{\cdot}(\rho \boldsymbol{u})=0 , \end{gather}
(2.1b)\begin{gather}\rho\left(\frac{\partial \boldsymbol{u}}{\partial t}+\boldsymbol{u} \boldsymbol{\cdot} \boldsymbol{\nabla} \boldsymbol{u}\right)={-}\boldsymbol{\nabla}\left(p_{{\perp}}+\frac{B^2}{8{\rm \pi}}\right)+\boldsymbol{\nabla} \boldsymbol{\cdot}\left[\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}\left(p_\perp{-} p_\parallel{+} \frac{B^2}{4{\rm \pi}}\right)\right] , \end{gather}
(2.1c)\begin{gather}\frac{\partial \boldsymbol{B}}{\partial t}=\boldsymbol{\nabla} \boldsymbol{\times}(\boldsymbol{u} \boldsymbol{\times} \boldsymbol{B}), \end{gather}
(2.1d)\begin{gather}\frac{\partial p_{{\perp}}}{\partial t}+\boldsymbol{\nabla} \boldsymbol{\cdot}\left(p_{{\perp}} \boldsymbol{u}\right)+p_{{\perp}} \boldsymbol{\nabla} \boldsymbol{\cdot} \boldsymbol{u}+\boldsymbol{\nabla} \boldsymbol{\cdot}\left(q_{{\perp}} \hat{\boldsymbol{b}}\right)+q_{{\perp}} \boldsymbol{\nabla} \boldsymbol{\cdot} \hat{\boldsymbol{b}} =p_{{\perp}} \hat{\boldsymbol{b}} \hat{\boldsymbol{b}}\,\boldsymbol{:}\, \boldsymbol{\nabla} \boldsymbol{u} , \end{gather}
(2.1e)\begin{gather}\frac{\partial p_{\|}}{\partial t}+\boldsymbol{\nabla} \boldsymbol{\cdot}\left(p_{\|} \boldsymbol{u}\right)+\boldsymbol{\nabla} \boldsymbol{\cdot}\left(q_{\|} \hat{\boldsymbol{b}}\right)-2 q_{{\perp}} \boldsymbol{\nabla} \boldsymbol{\cdot} \hat{\boldsymbol{b}} ={-}2 p_{\|} \hat{\boldsymbol{b}}\hat{\boldsymbol{b}}\,\boldsymbol{:}\,\boldsymbol{\nabla}\boldsymbol{u}. \end{gather}

Our notation is standard: $\rho$ is the mass density, $\boldsymbol {u}$ is the bulk fluid velocity, $\boldsymbol {B}$ is the magnetic field, and $p_\perp$ and $p_\parallel$ are the components of the pressure tensor oriented perpendicular and parallel to the local magnetic-field direction $\hat {\boldsymbol {b}}=\boldsymbol {B}/B$. Note that (2.1a) and (2.1c) may be used in (2.1d) and (2.1e) to obtain

(2.2a)$$\begin{gather} p_\perp \frac{{\rm d} }{{\rm d} t} \ln \frac{p_\perp}{\rho B} ={-} \boldsymbol{\nabla}\boldsymbol{\cdot}(q_\perp\hat{\boldsymbol{b}}) - q_\perp\boldsymbol{\nabla}\boldsymbol{\cdot}\hat{\boldsymbol{b}} , \end{gather}$$
(2.2b)$$\begin{gather}p_\parallel \frac{{\rm d} }{{\rm d} t}\ln \frac{p_\parallel B^2}{\rho^3} ={-}\boldsymbol{\nabla}\boldsymbol{\cdot}(q_\parallel\hat{\boldsymbol{b}}) + 2q_\perp\boldsymbol{\nabla}\boldsymbol{\cdot}\hat{\boldsymbol{b}} , \end{gather}$$

where ${\rm d}/{\rm d} t\doteq \partial /\partial t + \boldsymbol {u}\boldsymbol {\cdot }\boldsymbol {\nabla }$ is the comoving time derivative. These equations express conservation of the first and second adiabatic invariants in the frame of the flow but for the conservative redistribution of $p_\perp$ and $p_\parallel$ by the field-aligned conductive flows of perpendicular and parallel heat, $q_\perp$ and $q_\parallel$, respectively. Our third assumption is that these heat fluxes are given by the ‘$3+1$ model’ of Snyder et al. (Reference Snyder, Hammett and Dorland1997), which is tailored to capture the linear collisionless damping rates of small-amplitude fluctuations:

(2.3a)\begin{gather} q_{{\perp}}={-}\frac{2 c_{{\rm s}\|}^2}{\sqrt{2 {\rm \pi}} c_{{\rm s} \|}\left|k_{\|}\right|}\left[\rho \boldsymbol{\nabla}_{\|} \left(\frac{p_{{\perp}}}{\rho}\right)-p_{{\perp}}\left(1-\frac{p_{{\perp}}}{p_{\|}}\right) \frac{\boldsymbol{\nabla}_{\|} B}{B}\right], \end{gather}
(2.3b)\begin{gather}q_{\|}={-}\frac{8 c_{{\rm s}\|}^2}{\sqrt{8 {\rm \pi}} c_{{\rm s} \|}\left|k_{\|}\right|} \rho \boldsymbol{\nabla}_{\|}\left(\frac{p_{\|}}{\rho}\right), \end{gather}

where $c_{{\rm s},\|}=\sqrt {p_\|/\rho }$ is the parallel sound speed. The parameter $|k_\parallel |$ is a characteristic field-aligned wavenumber for the fluctuations in $B$, $p_{\perp /\|}$, and $\rho$ (e.g. Sharma et al. Reference Sharma, Hammett, Quataert and Stone2006); it is a simplification of the magnitude of the magnetic-field-aligned gradient operator, which would otherwise be very costly to calculate exactly because it is defined with respect to the local magnetic field and involves global operations. Because these heat fluxes describe linear collisionless damping, inherent in their use is the assumption that any perturbations produced by the waves studied here are sufficiently small with respect to the background plasma that nonlinear effects, such as saturation of the collisionless damping, can be ignored. Accordingly, while their analytical simplicity is invaluable in closing the CGL-MHD system, we discuss in § 2.3 some circumstances in which these heat fluxes become inaccurate.

2.1. Formulation of the problem

Armed with (2.1), we now constrain the waves whose interactions are the focus of this paper. First, we consider only waves that propagate parallel to the background magnetic field $\boldsymbol {B}_0$, such that $k_\perp =0$. This choice removes the nonlinear Alfvénic interaction that normally drives a cascade to small scales, meaning that any interaction seen is solely a consequence of the effects of pressure anisotropy discussed in this work. Admittedly, many problems of interest involve fluctuations that possess $k_\perp /k_\parallel$ of order unity or even much greater, such as critically balanced Alfvénic turbulence (Goldreich & Sridhar Reference Goldreich and Sridhar1995). However, as we demonstrate in § 2.2, one of the most important aspects of these wave interactions concerns the time scales associated with each interacting wave. Because AWs and IAWs possess frequencies that are independent of $k_\perp$, this facet of their interaction is captured regardless of $k_\perp$ (further discussion of the $k_\perp =0$ assumption can be found within § 4). Second, we assume that no constant background pressure anisotropy $\varDelta _0$ is present in the plasma. A non-zero background pressure anisotropy may be included with little effect to the results (so long as the plasma remains stable) by replacing the Alfvén speed $v_{\rm A}\doteq B_0/\sqrt {4{\rm \pi} \rho _0}$, where $\rho _0$ is the (assumed uniform) background density, with an effective Alfvén speed $v_{{\rm A},{\rm eff}}\doteq v_{\rm A}\sqrt {1+\beta \varDelta _0/2}$.

Our chief interest in this paper is how an AW responds to the fluctuating pressure anisotropy that results from other waves – a channel of communication that is customarily ignored in studies of plasma turbulence. Therefore, to distil this physics from (2.1), we pursue an asymptotic ordering of the wave perturbations that isolates the Alfvénic response to an externally generated pressure anisotropy $\varDelta =\varDelta (t,\boldsymbol {r})$. First, the AWs subject to $\varDelta$ are taken to be small enough in amplitude that their own nonlinearities can be ignored on the time scales of interest here, viz. ${\delta \boldsymbol {B}_\perp /B_0\sim \boldsymbol {u}_\perp /v_{\rm A}\ll 1}$. With $k_\perp = 0$, the vectors $\delta \boldsymbol {B}_\perp$ and $\boldsymbol {u}_\perp$ can be oriented along any perpendicular coordinate without loss of generality, and so henceforth we refer only to $\delta B_\perp$ and $u_\perp$. Second, we take $\varDelta$ to be large enough that its effect on the Alfvénic dynamics cannot be neglected, i.e. $|\beta \varDelta | \sim 1$. This ordering allows the stress associated with the pressure anisotropy, $(p_\perp -p_\parallel )\hat {\boldsymbol {b}}\hat {\boldsymbol {b}}$, to be comparable to the Maxwell stress, $\boldsymbol {BB}/4{\rm \pi}$. Lastly, we allow for compressive fluctuations in the parallel velocity ($u_\|$) and density ($\delta \rho$). Because $k_\perp = 0$ implies $\delta B_\parallel =0$, these fluctuations are purely acoustic, propagating at a speed ${\sim }c_{\rm s}$ that is much faster than $v_{\rm A}$. These assertions combined, the resulting maximal ordering is given by

(2.4)\begin{equation} |\varDelta| \sim \frac{\delta B_\perp}{B_0} \sim \frac{u_\perp}{v_{\rm A}} \sim \frac{u_\parallel}{c_{\rm s}} \sim \frac{\delta\rho}{\rho_0} \sim \frac{1}{\beta}\doteq\epsilon\ll 1. \end{equation}

To complete the ordering, we also make assumptions regarding the time evolution of these quantities. In particular, we assume that the time derivatives of $\delta B_\perp$ and $u_\perp$ are Alfvénic ($\partial _t \sim k_\parallel v_{\rm A}$), while all other (compressive) quantities evolve on sonic time scales ($\partial _t \sim k_\parallel c_{\rm s}$).

Without yet discussing the production/evolution of $\varDelta$ (to be addressed in §§ 2.2 and 2.3), we may then derive the evolution equations for our ordered Alfvénic fluctuations. Applying (2.4) to the components of (2.1b) and (2.1c) that are perpendicular to the guide field, and retaining only the leading-order terms in $\epsilon$, we find that

(2.5a)\begin{gather} \frac{\partial \delta B_\perp}{\partial t} = B_0 \frac{\partial u_\perp}{\partial x}, \end{gather}
(2.5b)\begin{gather}\rho_0 \frac{\partial u_\perp}{\partial t} = \frac{B_0}{4{\rm \pi}} \frac{\partial B_\perp}{\partial x} + \frac{\beta}{2} B_0 \frac{\partial }{\partial x} \left(\delta B_\perp \varDelta \right) . \end{gather}

The transparency of these equations can be improved somewhat by rewriting them in terms of Elsässer variables $z^\pm = u_\perp \pm \delta B_\perp /\sqrt {4{\rm \pi} \rho _0}$ (Elsässer Reference Elsässer1950):

(2.6)\begin{equation} \frac{\partial z^\pm}{\partial t} \mp v_{\rm A} \frac{\partial z^\pm}{\partial x} = v_{\rm A} \frac{\beta}{4} \frac{\partial }{\partial x} \left[ (z^+{-} z^-) \varDelta \right] . \end{equation}

The left-hand side of these Elsässer equations describe forward ($-$) and backward ($+$) propagating AWs. The right-hand side provides the nonlinear interaction with another fluctuation that perturbs the pressure anisotropically. This anisotropy can arise from linear acoustic modes, or even other large-amplitude AWs that generate $\varDelta$ nonlinearly. We analyse both of these situations in §§ 2.2 and 2.3.

2.2. Interaction between ion-acoustic and Alfvén waves

In this section, we describe the interaction between monochromatic IAWs and AWs in an otherwise uniform plasma with density $\rho _0$ and pressure $p_0 = \beta B_0^2/8{\rm \pi}$. This demands an understanding of the evolution of the $\varDelta$ that appears on the right-hand side of (2.6). In our high-$\beta$ ordering (2.4), the IAW fluctuations are energetically dominant over the Alfvénic fluctuations and, given their compressive nature, are chiefly responsible for the pressure perturbations $\delta p_\|$ and $\delta p_\perp$. This can be seen by applying (2.4) to the compressive components of (2.1), recalling that the time derivatives of compressive quantities are ordered as $\partial _t \sim k c_{\rm s}$, and retaining only leading-order terms:

(2.7a)\begin{gather} \frac{\partial \delta \rho}{\partial t} + \rho_0 \frac{\partial u_\parallel}{\partial x} = 0 , \end{gather}
(2.7b)\begin{gather}\rho_0 \frac{\partial u_\parallel}{\partial t} ={-}\frac{\partial \delta p_\parallel}{\partial x} , \end{gather}
(2.7c)\begin{gather}\frac{1}{p_0} \frac{\partial \delta p_\perp}{\partial t} = \frac{1}{\rho_0} \frac{\partial \delta \rho}{\partial t} + \frac{\sqrt{2} c_{\rm s}}{\sqrt{\rm \pi}|k_\parallel|p_0} \frac{\partial }{\partial x} \left( \frac{\partial \delta p_\perp}{\partial x} - \frac{p_0}{\rho_0} \frac{\partial \delta\rho}{\partial x}\right), \end{gather}
(2.7d)\begin{gather}\frac{1}{p_0} \frac{\partial \delta p_\parallel}{\partial t} = \frac{3}{\rho_0} \frac{\partial \delta\rho}{\partial t} + \frac{\sqrt{8} c_{\rm s}}{\sqrt{\rm \pi}|k_\parallel| p_0} \frac{\partial }{\partial x} \left(\frac{\partial \delta p_\parallel}{\partial x} - \frac{p_0}{\rho_0} \frac{\partial \delta\rho}{\partial x} \right). \end{gather}

These equations describing ion-acoustic fluctuations are entirely linear, with no feedback from the Alfvénic fluctuations. Therefore, we may simply prescribe the pressure anisotropy in (2.6) being due to ion-acoustic fluctuations as

(2.8)\begin{equation} \varDelta(t,x) = \varDelta_0 \exp (\mathrm{i}k_{\rm C}x + \mathrm{i}\omega _{\rm C} t), \end{equation}

where $\omega _{\rm C} \approx (\pm 1.47 + 0.46\mathrm {i})k_{\rm C} c_{\rm s}$ is the (complex) linear frequency of a damped ion-acoustic wave.Footnote 4 Substituting this anisotropy (with $+\omega _{\rm C}$ to start) into the Elsässer equation (2.6) and Fourier transforming into $k$-space yields the following evolution equation for AWs at each wavenumber:

(2.9)\begin{equation} \frac{\partial z^\pm (k)}{\partial t} \mp {\rm i} kv_{\rm A} z^\pm (k) = {\rm i} kv_{\rm A} \frac{\beta\varDelta_0}{4} \, {\rm e}^{{\rm i}\omega _{\rm C} t} \left[ z^+(k-k_{\rm C}) - z^-(k-k_{\rm C}) \right]. \end{equation}

The right-hand side of (2.9) shows that both forward- and backward-propagating AWs are generated at different wavenumbers when an AW is subject to the pressure anisotropy of an acoustic wave. Although this interaction term is not precisely proportional to the level of imbalance $|z^+(k)|^2-|z^-(k)|^2$, no change to either $z^+$ or $z^-$ will result if their initial amplitudes are exactly equal.

Assuming only an initial $z^-$, which makes the acoustic and Alfvén waves counter-propagating, the dynamics at the initial Alfvén mode's wavenumber $k_{\rm A}$ is simply

(2.10)\begin{equation} \frac{\partial z^\pm (k_{\rm A})}{\partial t} \mp {\rm i} k_{\rm A} v_{\rm A} z^\pm (k_{\rm A}) = 0 \quad \Longrightarrow \quad z^- (k_{\rm A}) = z_0 \exp({-{\rm i} k_{\rm A}v_{\rm A}t});\quad z^+(k_{\rm A})=0. \end{equation}

The leading-order wave–wave interaction can be found at $k_{\rm A}+k_{\rm C}$ by evaluating (2.9) given (2.10):

(2.11)\begin{gather} \left.\begin{aligned} z^+(k_{\rm A}+k_{\rm C}) \approx z_0\frac{\beta\varDelta_0}{4} \left( \frac{\omega_{\rm A}+k_{\rm C}v_{\rm A}}{2\omega_{\rm A} \,{-}\, \omega _{\rm C}+k_{\rm C}v_{\rm A}} \right) \left[\exp({{\rm i}(\omega _{\rm C}\,{-}\,\omega_{\rm A})t}) \,{-}\, \exp({{\rm i} (k_{\rm A} +k_{\rm C})v_{\rm A} t})\right], \\ z^-(k_{\rm A}+k_{\rm C}) \approx z_0\frac{\beta\varDelta_0}{4} \left( \frac{\omega_{\rm A}+k_{\rm C}v_{\rm A}}{\omega _{\rm C} + k_{\rm C}v_{\rm A}} \right) \left[\exp({{\rm i}(\omega _{\rm C}-\omega_{\rm A})t}) \,{-}\, \exp({-{\rm i} (k_{\rm A} +k_{\rm C})v_{\rm A} t})\right] , \end{aligned}\!\right\} \end{gather}

where we have defined $\omega _{\rm A} \doteq k_{\rm A}v_{\rm A}$. Note that the choice $k_{\rm C} \sim k_{\rm A}$ implies $|\omega _{\rm C}| \gg \omega _{\rm A}$, and thus a weak interaction in high-$\beta$ plasmas. To obtain a stronger interaction, we focus on the amplitude of the reflected wave packet (for $\mathrm {Re}(\omega _{\rm C})>0$) and maximize the absolute value of the $z^+$ coefficient. Doing so indicates that the strongest interaction occurs when

(2.12)\begin{equation} \frac{k_{\rm C}}{k_{\rm A}} = \frac{2k_{\rm C}v_{\rm A}(k_{\rm C}v_{\rm A}+\omega_{{\rm C,r}})}{\omega^2_{{\rm C,r}}+\omega^2_{{\rm C,i}}-k^2_{\rm C}v^2_{\rm A}} \approx \frac{2}{\sqrt{\beta}}, \end{equation}

where the subscripts ‘${\rm r}$’ and ‘${\rm i}$’ denote the real and imaginary parts of $\omega _{\rm C}$. With ${\beta \gg 1}$, (2.12) makes the absolute value of the $z^-$ coefficient ${\approx }0.3z_0\beta \varDelta _0$. For $\beta \varDelta _0 \sim 1$, this is always an $\mathcal {O}(1)$ interaction, decreasing linearly with the amplitude of the acoustic mode. Perhaps unsurprisingly, the strongest interaction occurs when the frequencies are nearly matched, although this also means that there is effectively no change in wavenumber of the Alfvénic fluctuations, since $k_{\rm A}+k_{\rm C} \approx k_{\rm A} (1 + 2/\sqrt {\beta }) \sim k_{\rm A}$. In this sense, the strongest IAW–AW interaction resembles the frequency-matching condition of the parametric decay instability (Sagdeev & Galeev Reference Sagdeev and Galeev1969). Additionally, while the maximum $z^+$ coefficient occurs for $\omega _{{\rm C}, {\rm r}}>0$, the situation with $k_{\rm C} \sim k_{\rm A}/\sqrt {\beta }$ still yields an $\mathcal {O}(1)$ reflection if $\omega _{{\rm C}, {\rm r}} <0$ (i.e. if the waves are co-propagating).

To apply these conclusions to physical systems, we must also consider the case where $\varDelta$ is purely real. This demands that the Fourier expansion of $\varDelta$ contains both $+\textrm {i} k_{\rm C}$ and $-\textrm {i} k_{\rm C}$ exponentials, which in turn create source terms on the right-hand side of (2.9) from both $k-k_{\rm C}$ and $k+k_{\rm C}$. While this complicates the problem somewhat, (2.11) can still be used to obtain an order-of-magnitude estimate for the solution. The reason for this stems from the fact that the acoustic mode collisionlessly damps on the same time scale as it propagates. By the time an Alfvénic fluctuation at $k_{\rm A}+k_{\rm C}$ grows because of the interaction to a similar amplitude as that of the original wave at $k_{\rm A}$, $\varDelta$ has largely vanished. The subsequent interaction of the $k_{\rm A}+k_{\rm C}$ AW with the now nearly depleted $\varDelta$ to generate a $k_{\rm A}+2k_{\rm C}$ mode will be considerably weaker. As a result, the amplitude of the $k_{\rm A}+2k_{\rm C}$ fluctuation is likely to be small, and may not provide significant feedback to the evolution of the mode at $k_{\rm A}+k_{\rm C}$. This prediction is tested using numerical simulations in § 3.

The conservation of certain quantities by this interaction may also be of interest. In particular, it may appear jarring that there is a nonlinear mechanism in (2.6) by which new Alfvénic fluctuations can be generated, but no corresponding sink present in (2.7). This would seem to imply that energy is not conserved if $|z^\pm |^2$ increase but nothing happens to the acoustic fluctuations. In this case, the apparent mismatch stems from the fact that, if $u_\parallel /c_{\rm s} \sim u_\perp /v_{\rm A}$, then there is considerably more (${\sim } \beta \times$) energy present in the acoustic fluctuations than in the Alfvénic ones. As a result, any energy given up by the acoustic fluctuations is higher order to themselves, but leading order to an AW with comparable amplitude. This can be demonstrated by going to third order in the compressive equations and re-establishing conservation of energy, a calculation presented in detail in Appendix A.Footnote 5 Why energy goes to the Alfvénic from the acoustic fluctuations is not as obvious. The reason becomes apparent, however, when considering the same interaction without damping of the acoustic wave. With $\omega _{{\rm C},{\rm i}}=0$ in (2.11), the leading-order Alfvénic response would then have a purely periodic amplitude at each $k$. Essentially, the effects of a fluctuation with $\varDelta >0$ would be nearly wiped out by the subsequent equivalent negative anisotropy. Only the root mean square amplitude of the newly generated fluctuations would grow, and on a time scale much slower than the AW propagation. Instead, if the anisotropy decays, then the positive anisotropy generated during the first half of the acoustic period would be larger than the negative anisotropy generated during the second half-period, meaning that the difference in the nonlinear effects accumulates. Once the acoustic wave is decayed away, each $k$ of the Alfvénic spectrum is left with only a steady-state amplitude and energy.

2.3. Interaction between two AWs

The other source of pressure anisotropy we consider for (2.6) is another, larger-amplitude AW. In particular, we are interested in the ability of nonlinear AWs to generate pressure anisotropy, thereby modifying their own $v_{{\rm A},\textrm {eff}}$ in a manner that is dependent upon their amplitudes (similarly to the interruption process described in § 1; Squire et al. Reference Squire, Quataert and Schekochihin2016). We therefore choose to study the interaction between two co-propagating AW packets, which are initially isolated such that their self-modified $v_{{\rm A},\textrm {eff}}$ do not affect the other wavepacket. In this set-up, one of the wave packets will be large enough in amplitude such that it generates a pressure anisotropy $\varDelta \sim \beta ^{-1}$, thereby fulfilling the role of the IAW in § 2.2. This will be called the ‘primary’ wave packet (with relevant quantities denoted by the subscript ‘p’). We first calculate the pressure anisotropy generated by this primary wave packet and demonstrate that its $v_{{\rm A},\textrm {eff}}$ is increased as a result. Given sufficient time then, the two initially separated wave packets will be able to interact if the trailing wave packet is of larger amplitude. To study the subsequent interaction, we consider the leading (‘secondary’) wave packet to satisfy the Alfvénic components of (2.4), thereby allowing us to employ (2.6) in solving for its nonlinear deformation, while ignoring the back reaction on the primary AW packet.

The mechanism for producing pressure anisotropy differs considerably between IAWs and shear-Alfvén waves. While pressure anisotropy is produced at linear order for the former, the latter change neither the magnetic-field strength nor the plasma density to linear order in the fluctuation amplitude, and so by adiabatic invariance have no associated linear pressure anisotropy. Indeed, (2.2) without the heat fluxes giveFootnote 6

(2.13a)\begin{gather} \frac{{\rm d} }{{\rm d} t} \left(\frac{p_\perp}{\rho B}\right) = 0 \quad\Longrightarrow\quad \frac{{\rm d} p_\perp}{{\rm d} t} = \frac{p_\perp}{\rho}\frac{{\rm d} \rho}{{\rm d} t} + \frac{p_\perp}{B}\frac{{\rm d} B}{{\rm d} t} , \end{gather}
(2.13b)\begin{gather}\frac{{\rm d} }{{\rm d} t} \left(\frac{p_\parallel B^2}{\rho^3}\right) = 0 \quad\Longrightarrow\quad \frac{{\rm d} p_\parallel}{{\rm d} t} = \frac{3p_\parallel}{\rho}\frac{{\rm d} \rho}{{\rm d} t} - \frac{2p_\parallel}{B}\frac{{\rm d} B}{{\rm d} t} . \end{gather}

The magnetic-field strength changes only at second order in the fluctuation amplitude, according to $B = \sqrt {B_0^2 +\delta B_{\perp,{\rm p}}^2}$ (while $\delta \rho = 0$). From this, it is clear that $\delta B^2_{\perp,{\rm p}}/B^2_0$ must be ${\gtrsim }1/\beta$ for the stress associated with the induced pressure anisotropy to be comparable to the Maxwell stress (the usual restoring force in an AW; Squire et al. Reference Squire, Quataert and Schekochihin2016). In AW packets of this amplitude, the addition of a competitive pressure-anisotropic stress dictates that their initial perturbation will be unable to propagate rigidly at $v_{\rm A}$ as is the case when $\varDelta =0$. Instead, a strictly positive (owing to $\delta B_{\perp, {\rm p}}^2 >0$) nonlinear pressure anisotropy will be generated by the oscillating $\delta B_{\perp, {\rm p}}$ with a dominant wavenumber of $k=2k_{\rm A}$. This cannot, however, last in the presence of the heat fluxes (2.4). The pressure anisotropy $\varDelta _{\rm p}$ is generated by the AW packet as it propagates at a rate of $\sim kv_{\rm A}$, while it is diffused by the heat fluxes at a much faster rate of $\sim kc_{\rm s}$. As a result, $\varDelta _{\rm p}$ is smeared out across the wave packet as it propagates, inheriting the spatial structure of the wave packet envelope rather than the mode itself. The end result is that shear AW packets initialized with an amplitude $\delta B_\perp /B_0 \gtrsim \beta ^{-1/2}$ generate smooth, envelope-scale, positive pressure anisotropies. Accordingly, $v_{{\rm A},\textrm {eff}}>v_{\rm A}$ and such packets propagate faster than those with small (linear) amplitudes.Footnote 7

With a $\varDelta _{\rm p}$ profile resembling the envelope of the parent AW packet (rather than the oscillating wave within the envelope), there will be a peak in the magnitude of $\varDelta _{\rm p}$ at the centre of the packet, where $\delta B_{\perp, {\rm p}}^2$ is the largest. Through $v_{{\rm A},\textrm {eff}} = v_{\rm A}\sqrt {1+\beta \varDelta _{\rm p}(x)/2}$, this region of the packet will travel faster than the regions with lower $\delta B_{\perp, {\rm p}}^2$ that occur at the trailing edge of the envelope. In time, an isolated AW packet should therefore steepen to form a shock-like wavefront, which propagates at an enhanced $v_{{\rm A},\textrm {eff}}$ from the background.Footnote 8 It is this shocked, more rapidly propagating wave packet that can interact with smaller amplitude AW packets in front of it. In a collisionless plasma, however, the heat fluxes driven by a shock-like gradient in the pressure cannot be accurately described by (2.3). This complicates our effort to solve (2.1d) and (2.1e) for the $\varDelta _{\rm p}$ of the steepened packet. In this case, two alternative descriptions of $\boldsymbol {q}_{\perp /\|}$ are likely to be more accurate. For one, the heat fluxes may simply be that of free streaming particles. Unless reflected by the mirror force, plasma particles would transit the shock front and change their perpendicular temperature adiabatically according to $\delta T_{\perp,{\rm p}}/T_0 \approx \delta B_{\perp,{\rm p}}^2/2B_0^2$. Another possibility, which is likely for near-interruption AW packets in particular, is that the distribution of particle velocities resulting from an adiabatic response to $\delta B_{\perp,{\rm p}}$ would be unstable to kinetic microinstabilities. Larmor-scale magnetic fluctuations generated by the ion-acoustic or whistler heat-flux instabilities (Komarov et al. Reference Komarov, Schekochihin, Churazov and Spitkovsky2018; Roberg-Clark et al. Reference Roberg-Clark, Drake, Swisdak and Reynolds2018) could scatter particles in pitch angle, giving the heat fluxes a similar functional form as (2.3), but with a different effective diffusion rate. We choose to continue our analysis under this assumption of diffusive heat fluxes, both because it may apply in particular to near-interruption AW packets, and for the fact that our ‘$3+1$’ Landau-fluid heat fluxes (2.3) are also diffusive, allowing us to verify the results numerically for arbitrary diffusion coefficients by modifying $|k_\||$ (Appendix B). In this case, given that particle motions are thermal, the upstream plasma with its higher temperature is able to pass through the shock front and diffuse downstream, at a rate controlled by the diffusion coefficient. This leads to the development of a pressure anisotropy precursor, which stretches in front of the shock as it propagates. For a finite diffusion coefficient, this precursor decays away steadily as one moves further downstream of the shock. We derive this precursor analytically in Appendix B for a general diffusion coefficient $\kappa$, finding that the functional form of the pressure anisotropy is a decaying exponential in a frame moving with the shock.

What remains to be discussed is how the $\varDelta _{\rm p}$ precursor of a large AW packet affects smaller-amplitude AW packets that cannot outrun its increased propagation speed, $v_{{\rm A},\textrm {eff}}>v_{\rm A}$. As the tail of a secondary wave packet begins to feel the $\varDelta _{\rm p}$ of the larger-amplitude primary packet approaching from behind it, there will be a larger $v_{{\rm A},\textrm {eff}}$ at the back of the secondary wave packet than at the front. This will cause the secondary packet to compress, shortening its parallel wavelength. Unlike the IAW–AW interaction however, no backward propagating AW packet will be created. To see why, consider the evolution equation (2.6) for $z^+$ assuming only an initial $z^-$:

(2.14)\begin{equation} \frac{\mathrm{d} z^+}{\mathrm{d}t} \doteq \frac{\partial z^+}{\partial t} - v_{{\rm A},{\rm eff}}(x) \frac{\partial z^+}{\partial x} = v_{\rm A}\frac{\beta}{4} z^+ \frac{\partial \varDelta _{\rm p}}{\partial x} - v_{\rm A}\frac{\beta}{4} \frac{\partial}{\partial x} \left( z^- \varDelta _{\rm p} \right). \end{equation}

The final term on the right-hand side represents a possible source or sink, depending on its sign. However, for $\partial _x \varDelta _{\rm p} <0$, the first term on the right-hand side damps $z^+$ exponentially along the wave characteristics. Given that the $\varDelta$-precursor of a large-amplitude $z^-$ packet is monotonically decreasing in $x$, $z^+$ fluctuations are always damped in this interaction. This simplifies the equation for $z^-$, which can be written in terms of its energy density $\mathcal {E}^-(x) \doteq |z^-|^2$ by assuming $z^+=0$:

(2.15)\begin{equation} \frac{\partial }{\partial t} \ln{\mathcal{E}^-} + v_{\rm A} \left(1+\frac{\beta\varDelta _{\rm p}}{4} \right)\frac{\partial }{\partial x} \ln{\mathcal{E}^-} ={-}v_{\rm A}\frac{\beta}{2}\frac{\partial \varDelta _{\rm p}}{\partial x}. \end{equation}

Solving (2.15) using the method of characteristics yields

(2.16)\begin{equation} \mathcal{E}^-(x,t) = \mathcal{E}_0^-(x_0(t,x)) \exp \left\{{-}v_{\rm A}\frac{\beta}{2}\int\limits_0^t\,\mathrm{d}t'\left.\frac{\partial \varDelta _{\rm p}}{\partial x}\right|_{x(t',x_0)}\right\}, \end{equation}

where $x(t,x_0)$ is the trajectory of the $x_0$ characteristic determined by the solution of

(2.17)\begin{equation} \frac{{\rm d} x}{{\rm d} t} = v_{\rm A}\left(1+\frac{\beta\varDelta _{\rm p}(t,x)}{4} \right), \end{equation}

and $x_0(t,x)$ is the foot of the characteristic obtained by inverting $x(t,x_0)$. Equation (2.16) states that the tail of the smaller-amplitude AW packet will steepen according to ${d}_tx$, while growing exponentially along characteristics (and more rapidly so for steeper $\varDelta _{\rm p}$). Motivated by the derivation of the $\varDelta _{\rm p}$ precursor for diffusive heat fluxes in Appendix B, we consider an exponentially decaying anisotropy

(2.18)\begin{equation} \varDelta _{\rm p}(x) = \varDelta_{{\rm p}, 0}\exp [(x-v_{{\rm A},{\rm eff}}t)/l_{\varDelta}] , \end{equation}

which propagates rigidly at $v_{{\rm A},\textrm {eff}} = v_{\rm A}\sqrt {1+\beta \varDelta _{{\rm p}, 0}/2}$. Figure 1(a) shows the characteristics that such an anisotropy would generate if $\varDelta _{{\rm p},0} = \beta ^{-1}$ and $l=0.1L$, where $L$ is a characteristic domain length. The compression of characteristics indicates that the trailing end of the secondary mode develops a shorter wavelength over time. Figure 1(b) exhibits the gain in energy experienced by a Gaussian wave packet that is subjected to this $\varDelta _{\rm p}(x)$ after $t=L/v_{\rm A}$, for different values of the decay length $l_{\varDelta }/L$. As $l_{\varDelta }/L$ becomes smaller, the gain in energy of the mode begins to increase. In general, this trend of increasing energy gain will continue, but here the gain curve drops when $l_{\varDelta }/L$ becomes small enough that the packets becomes too well separated to feel the effect of $\varDelta$ within a time $t=L/v_{\rm A}$ (for verification of this, we show the gain curve for two identical secondary packets initially separated from the primary by $\delta x = 0.4L$ and $\delta x = 0.8L$). Without the interference of other waves in this duo, the secondary packet compression will likely continue until the anisotropy of the primary wave damps away, or the wavelength of the secondary packet reaches dissipation scales. In a scenario where the large-amplitude wave interacts with many smaller-amplitude fluctuations, the compression (and therefore energization) of the smaller modes should be considered in the damping of the larger-amplitude mode, likely enhancing the rate of decay.Footnote 9

Figure 1. (a) Characteristic curves for an AW subjected to an exponentially decaying pressure anisotropy (2.18) with characteristic decay length $l_\varDelta =0.1L$ and phase speed $v_{{\rm A},\textrm {eff}}=\sqrt {2}v_{\rm A}$. Characteristics originating near the left of the domain begin to converge by the time they reach the right side of the domain, indicating $\varDelta$-induced AW steepening. (b) Energy gain by the secondary AW packet after one Alfvén-crossing time for different $l_\varDelta$, demonstrating that a steeper $\varDelta$ profile leads to more rapid gain in energy by the secondary packet.

We do not describe in any detail the interaction between primary and secondary anti-propagating AWs; however, there is a notable asymmetry in the $\varDelta$-mediated interactions between co- and anti-propagating AWs. Anti-propagating interactions are considerably weaker, both because of the much shorter interaction time caused by the larger difference in propagation velocity, and because of the difficulty of establishing any sort of frequency matching with an exponential pressure anisotropy. With co-propagating AW packets, the primary packet modifies $v_{{\rm A},\textrm {eff}}$ nonlinearly, thus the difference in velocities between packets is proportional to $\delta B_\perp ^2/B_0^2$; in the anti-propagating case, the difference in velocities is ${\approx }2v_{\rm A}$. Furthermore, due to the monotonically decreasing $v_{{\rm A},\textrm {eff}}$ associated with the $\varDelta$-precursor of the primary packet, the primary packet carries the co-propagating secondary packet with it and the two can interact until microphysical effects interfere. Alternatively, for an anti-propagating interaction, if one were to calculate the $k$-space dependence of the primary packet's $\varDelta$, then the corresponding version of (2.9) could be solved, but instead with numerous $k_{\rm C}$ because $\varDelta$ would not be monochromatic. That said, unlike in the IAW–AW interactions, the fact that $\varDelta$ is not monochromatic makes it difficult to achieve any semblance of frequency matching, and thus the interaction cannot similarly be tuned to maximize its strength.

3. Numerical simulations

To test the predictions made in § 2, we perform fluid simulations of each wave–wave interaction within the CGL-MHD framework (2.1). Our numerical approach uses a new Riemann solver implemented in a version of the finite-volume simulation code Athena++ (Stone et al. Reference Stone, Gardiner, Teuben, Hawley and Simon2008) that evolves $p_\perp$ and $p_\parallel$ using CGL equations that are closed with Landau-fluid heat fluxes (2.3) (Squire et al. Reference Squire, Kunz, Arzamasskiy, Johnston, Quataert and Schekochihin2023). Unless otherwise stated, the field-aligned wavenumber $|k_\parallel |$ in the Landau-fluid heat fluxes is set equal to the initial dominant wavenumber of the primary pressure-anisotropy-generating fluctuation, viz. the IAW in the acoustic–Alfvén interaction and the larger-amplitude AW in the Alfvén–Alfvén interaction. All simulations are performed with $\beta =400$, with only linear perturbations being initialized for the fluid moments of each wave: $u_\perp$ and $\delta B_\perp$ for AWs; and $u_\parallel$, $\delta \rho$ and $\delta p_{\perp /\parallel }$ for acoustic waves. The interaction of acoustic and Alfvénic fluctuations is studied primarily using monochromatic plane waves, while Alfvén–Alfvén interactions are studied using wave packets to demonstrate how a larger-amplitude AW packet can ‘catch up’ with an initially separated, smaller-amplitude AW packet. An acoustic–Alfvén interaction simulation was also performed using wave packets to visually depict the generation of a reflected AW as well as the non-locality of the interaction with respect to wavenumber. All monochromatic plane-wave simulations use periodic boundary conditions, while all simulations involving wave packets make use of outflow boundary conditions (where the derivatives of all fluid fields are set to zero at the boundaries).Footnote 10 The only spatial coordinate is $x \in [0,1]$, aligned with the background magnetic field and measured in dimensionless Alfvénic units (where $v_{\rm A} = B_0/\sqrt {4{\rm \pi} \rho _0}=1$) such that the Alfvén crossing time of a domain of length $L=1$ is $L/v_{\rm A}=1$. AW packets are initialized according to $\delta B_\perp = \alpha B_0 \psi (x)\cos (kx)$ and $u_\perp = \pm \alpha v_{\rm A} \psi (x)\cos (kx)$, with amplitude $\alpha$ and Gaussian envelopes $\psi (x)$ having standard deviations of approximately $2{\rm \pi} /k$. For the AW–AW interaction, the primary packet amplitude is $\alpha = 4/\sqrt {\beta }$, while the secondary packet amplitude is $\alpha = 1/\beta$. In the IAW–AW interaction, the acoustic wave is initialized with amplitude $\delta u_\parallel /v_\textrm {th} = 2/3\beta$, and the Alfvén wave is initialized with $\delta B_\perp /B_0 = \delta u_\perp /v_{\rm A} = 1/2\beta$. Because strong IAW–AW interactions at high $\beta$ are very non-local in $k$ space, AW fluctuations with wavelengths of as little as one-hundredth of the domain length need to be resolved. Similarly, for an AW packet to remain well separated from the outflow boundaries but still propagate for several linear times, the wave packet envelope and its dominant wavelength must be much smaller than the domain length. To capture these high-$k$ fluctuations adequately, a resolution of 9216 cells is used for the AW–AW interactions and up to 12 288 cells for the IAW–AW interactions. Importantly, the amplitudes of all fluctuations are chosen such that their associated pressure anisotropies are within the bounds of the firehose and mirror instabilities, $-2 \lesssim \beta \varDelta \lesssim 1$ (Rudakov & Sagdeev Reference Rudakov and Sagdeev1958; Southwood & Kivelson Reference Southwood and Kivelson1993).

3.1. Interaction between ion-acoustic and Alfvén waves

To illustrate the problem being studied, we begin with a simple example of co-propagating, quasi-monochromatic acoustic- and Alfvén-wave packets. We set the dominant wavenumber of the acoustic wave to $k_{\rm C} = k_{\rm A}/\sqrt {\beta }$, such that the modes are nearly matched in their linear frequencies. Figure 2 shows the evolution of these two packets from the initial set-up until the point when the IAW has passed out of the domain, leaving behind only the modified AW packet(s). The top panel, showing the initial conditions, demonstrates that the initial pressure anisotropy associated with the IAW (black) dips as low as $\beta \varDelta \approx -2$, and does not overlap with the AW to any significant degree. As the acoustic wave propagates, it decays rapidly at a rate of ${\simeq }0.31\omega _{\rm C}$. Therefore, by the time it overlaps with the AW, the amplitude of its driven pressure anisotropy has decayed significantly (middle panel). Nonetheless, the modification of the local $v_{{\rm A},\textrm {eff}}$ can be seen in the tail of the AW as giving rise to a new, smaller-amplitude fluctuation. By the time the IAW has decayed away and propagated out of the simulation domain, two changes have occurred to the initial wave packet. Most obvious is that, as predicted by (2.11), a smaller-amplitude, backward-propagating AW packet of roughly equal wavelength to the parent AW packet has been generated. This process therefore appears to mimic some features of the parametric decay instability of AWs. It distinguishes itself from parametric decay, however, by the more subtle increase in the amplitude of the initial forward-propagating AW packet. Instead of reflecting a portion of the initial wave packet and decreasing its amplitude, the IAW spawns these modifications to the AW packet by giving up an infinitesimal (with respect to itself, not the AW) portion of its own energy. In the end, the degree of Alfvénic imbalance, normalized by the total energy of the AW fluctuations, decreases by roughly $5\,\%$.

Figure 2. Landau-fluid CGL-MHD simulation of the interaction between the pressure anisotropy driven by an IAW packet (black) and an AW packet (orange) with matched frequencies. The IAW packet collisionlessly damps rapidly from the initial conditions (top frame), before nonlinearly deforming the short-wavelength AW packet (middle). After the IAW has decayed and propagated rightwards out of the domain, a backward-propagating AW packet has developed, with no apparent change to the wavenumber of the parent AW.

The interaction between the wave packets in figure 2 is expected to peak when the frequencies of the IAW and AW are roughly matched. However, the exact amplitude of the reflected fluctuation will depend on the minutia of the initial conditions, such as the packets’ initial separation and the widths of their envelopes. We therefore move to the periodic plane-wave set-up used to derive (2.9), so that we can assess the accuracy of (2.11) with fewer variable factors. In doing so, we choose to initialize a standing IAW with zero initial pressure anisotropy (otherwise the intensity of the interaction would be hidden by an immediate Alfvénic response to the non-zero anisotropy at $t=0$). This is accomplished by perturbing $u_\parallel (x,t=0) = \alpha v_\textrm {th}\sin (k_{\rm C}x)$ alone, with an amplitude $\alpha =2/3\beta$ such that the corresponding ‘forced’ anisotropy in (2.6) would be given by $\varDelta (x) = \beta ^{-1}\sin (\omega _{{\rm C,r}}t)\cos (k_{\rm C}x) \exp (-\omega _{{\rm C,i}}t)$. We simulate the interaction for a range of $k_{\rm A}/k_{\rm C}$, all for an elapsed time of $t_{f}= 2{\rm \pi} \omega _{\rm A}^{-1}$. The energy $\mathcal {E}^-=|z^-|^2$ contained within all $z^-$ fluctuations at the end of each simulation is plotted versus $k_{\rm A}/k_{\rm C}$ in figure 3, compared with an analytic estimate based off of (2.11). The analytic estimate is simply obtained by accounting only for the initial AW within the source term on the right-hand side of (2.9). Separating the expression for the pressure anisotropy generated by the standing wave into its complex exponential parts, the approximation is given by

(3.1)\begin{align} \mathcal{E}^-(t) &\approx \tfrac{1}{8}\left[ \left|z^-(k_{\rm A}+k_{\rm C}, \omega_{{\rm C,r}}) - z^-(k_{\rm A}+k_{\rm C}, - \omega_{{\rm C,r}})\right|^2 \right.\nonumber\\ &\quad \left.+ \left|z^-(k_{\rm A}-k_{\rm C}, \omega_{{\rm C,r}}) - z^-(k_{\rm A}-k_{\rm C}, -\omega_{{\rm C,r}})\right|^2 \right], \end{align}

where the $\pm \omega _{\rm C}$ in parentheses indicates the direction of IAW propagation used when evaluating $z(k_{\rm A}+k_{\rm C})$. As expected, both the prediction and the analytic estimate peak in interaction strength when $k_{\rm A}/k_{\rm C} \sim \sqrt {\beta }/2\sim 10$ and agree rather well. The reason for this agreement can be found within figure 4(a), which shows the energy spectra of the forward- and backward-propagating Alfvénic fluctuations at $t=2{\rm \pi} (5\omega _{\rm A})^{-1}$ and $t=2{\rm \pi} (\omega _{\rm A})^{-1}$. In figure 4(a), the slow, diffusive $k$-space nature of the IAW–AW interaction can be seen. With the initial AW represented by the peak in $\mathcal {E}^-(k)$, each Fourier component of the energy is several orders of magnitude smaller than the initial $k_{\rm A}$, with the sole exception being $k_{\rm A} \pm k_{\rm C}$ (denoted by two vertical dotted lines). Furthermore, the difference between $\mathcal {E}(k_{\rm A}\pm k_{\rm C})$ at $t=2{\rm \pi} (5 \omega _{\rm A})^{-1}$ and $t=2{\rm \pi} (\omega _{\rm A})^{-1}$ is much smaller than that for the higher/smaller $k$. Indeed, it is possible to show that when energy cascades in both directions and $k_{\rm C}/k_{\rm A}$ is infinitesimal, (2.9) exhibits diffusive-like behaviour in $k$-space, with a diffusion rate that is proportional to $k_{\rm C}^2$. Therefore, our estimate (3.1) is supported by this observation of $k_{\rm A}\pm k_{\rm C}$ being the dominant newly generated wavenumbers within a single Alfvén time. The $\mathcal {E}^\pm$ additionally evolve nearly identically, demonstrated by the relatively small change in cross-helicity with respect to the total energy shown in figure 4(b). This is expected, given that the form of (2.9) suggests this interaction relates to the degree of imbalance at each wavenumber and appears to effectively damp it over time.

Figure 3. Energy contained within backward-propagating AWs after one Alfvén time of interaction between a forward-propagating monochromatic AW and a standing monochromatic IAW, determined numerically as a function of the ratio of the Alfvén and acoustic wavenumbers using Landau-fluid simulations (red pluses). An approximate analytical solution for the energy (3.1) is shown as the solid line. Overall good agreement between theory and simulation is found, demonstrating strong interaction when the wave frequencies are approximately matched.

Figure 4. (a) Energy spectrum of forward- and backward-propagating Alfvénic fluctuations during interaction with a standing IAW. (b) Total change in wave energy (orange) and imbalance (black), normalized to the initial Alfvénic fluctuation energy $\mathcal {E}_0$, versus time. In panel (a), the IAW–AW interaction primarily generates new AW fluctuations at $k_{\rm A}\pm k_{\rm C}$, exhibited by the steepness of the spectra outside of the interval $k \in [k_{\rm A}-k_{\rm C}, k_{\rm A}+k_{\rm C}]$. This explains the accuracy of (3.1), as fluctuations at $k_{\rm A} \pm 2k_{\rm C}$ are too weak to affect the solution dramatically. In panel (b), the change in energy imbalance is several orders of magnitude smaller than the increase in total energy of the Alfvénic fluctuations, demonstrating that the IAW–AW interaction decreases imbalance with respect to overall AW energy.

3.2. Interaction between two AWs

The interaction of co-propagating AWs relies upon the differing propagation speeds that result when different amplitudes generate different local pressure anisotropies. Accordingly, all simulations presented in this section will be of initially isolated wave packets with outflow boundary conditions. Both wave packets are initialized with a dominant wavenumber of $k_{\rm A} = 80{\rm \pi}$, with amplitudes of $\delta B_z/B_0 = -\delta u_z/v_{\rm A} = 4/\sqrt {\beta }$ and $\delta B_y/B_0 = -\delta u_y/v_{\rm A} = \beta ^{-1}$ for the large-amplitude and small-amplitude waves, respectively. The magnetic perturbations have been assigned to different directions only for the purpose of visualizing them separately; this choice is not necessary for the interaction itself. The standard deviations of the Gaussian wave envelopes are $2{\rm \pi} /k_{\rm A}$ for both packets, the centres of which are initially separated by a distance of $\Delta x = 0.11$.

Before examining the interaction between these two wave packets, we first verify our predictions about the evolution of the primary wave packet. Figure 5(a) displays the primary AW packet after it has steepened to form a shock. As reported by Squire et al. (Reference Squire, Quataert and Schekochihin2016), the waveform (orange) has become square-like to minimize the change in $\delta B_{\perp, {\rm p}}^2$. Due to the rapid action of the heat fluxes, the pressure anisotropy (black) has assumed a shape similar to that of the wave envelope, with an additional precursor that extends in front of the magnetic perturbation. A negative dip in $\beta \varDelta$ is present behind the tail of the wave packet, although it does not propagate with the packet; it actually results from the magnetic perturbation departing from its initial location, causing a net decrease in $|B|$ at the location where it was initialized. This localized dip is akin to the negative anisotropy generated during the more conventional interruption process of a monochromatic AW (Squire et al. Reference Squire, Kunz, Quataert and Schekochihin2017a), as we did not initialize the wave packet with the pressure anisotropy that it generates shortly thereafter. Interruption does not occur here because the $\beta \varDelta$ dip is not sufficiently negative to nullify the magnetic tension. Figure 5(b) demonstrates the relationship between the perpendicular temperature and the perturbation to the magnetic-field strength. After an initial adjustment into the near-steady-state shocked AW packet, $\delta T_\perp$ closely follows the peak value of $\delta |B| = \delta B_{\perp,{\rm p}}^2$ at the shock front. This supports our prediction made in Appendix B that the maximum value of the pressure anisotropy can be calculated via conservation of the double adiabats.

Figure 5. (a) Magnetic field and pressure anisotropy profiles of the steepened AW packet, and (b) relationship between the perpendicular pressure and the magnetic perturbation amplitude at the front of the shock. The steepened wave packet is led by a shock in the magnetic-field profile, with a smoothed pressure anisotropy profile and $\varDelta$ precursor modifying the local Alfvén speed. The dip in $\varDelta$ behind the wave packet results from the initial conditions of the AW not including $\varDelta$, hence the magnetic-field strength decreases in this region as the packet propagates away. As this packet propagates, the approximate conservation of the $T_\perp /B$ adiabat sets the amplitude of the decaying anisotropy, and thus $v_{{\rm A},\textrm {eff}}$, at the shock front.

In § 2.3, we predicted that the pressure-anisotropy-enhanced propagation speed of large-amplitude (primary) AW packets would allow them to ‘catch up’ to and then distort smaller-amplitude (secondary) AW packets. Evidence of this physics can be found in figure 6, which shows how the pressure anisotropy generated by the primary (black) interacts with the magnetic-field perturbation of the secondary (orange).

Figure 6. Time slices of a Landau-fluid simulation of the the AW–AW packet interaction. The pressure anisotropy of the primary packet is evolved alongside the magnetic perturbation of the secondary packet. The packets are initially separated (top), yet over time, the enhanced effective speed allow the primary packet to approach the secondary from behind (middle). By the end of the simulation, the effective wavenumber of the secondary packet has more than doubled from the steepening induced by the primary packet's $\varDelta$ precursor.

In the top panel, shortly after the magnetic disturbance of the large-amplitude mode begins to propagate, there is a short-wavelength $\varDelta$. This pressure anisotropy is smoothed out rapidly by the action of heat fluxes, however, and leads to a smoothly increasing positive anisotropy perturbation whose front propagates at a slightly enhanced $v_{{\rm A},\textrm {eff}}>v_{\rm A}$. The middle panel at $k_{\rm A} v_{\rm A}t=9.9$ illustrates the moment that this front catches up to the tail end of the smaller-amplitude AW; the overlap between the two fluctuations causes this tail to propagate slightly faster, thereby compressing the waveform of the secondary. As this process continues into the final frame, the small-amplitude wave packet has had its width reduced by roughly half, and its amplitude increased due to the growth of $\mathcal {E}^-(x)$ (see (2.16)). Because the difference in Alfvén speeds between these two packets is relatively small, the cumulative deformation occurs slowly with respect to the Alfvén time of one of the packets, making this a weak interaction even though it leads to such a dramatic change in the structure of the wave packet envelope. This compression is expected to continue until the anisotropy of the large-amplitude packet has nonlinearly damped to be sufficiently small, or it is deformed by interaction with other wave packets.

4. Discussion: applications in turbulence

The analysis presented in §§ 2 and 3 suggests that modern theories of turbulence in high-$\beta$ collisionless plasmas must account for fundamentally different wave–wave interactions. In weak MHD turbulence, energy and momentum conservation dictate that AWs are unable to modify their $k_\|$ through interaction (Montgomery & Turner Reference Montgomery and Turner1981; Galtier et al. Reference Galtier, Nazarenko, Newell and Pouquet2000). Yet, we have just demonstrated a weak interaction of two AWs with differing amplitudes where the parallel wavelength does in fact change significantly. In a similar vein, the compressive cascade of MHD turbulence relies on the fact that slow modes possess a linear time scale that never exceeds the Alfvénic time scale of the turbulence that mixes them. Further, these slow modes have no effect on the Alfvénic cascade and effectively decouple from it below the forcing scale (Lithwick & Goldreich Reference Lithwick and Goldreich2001; Cho & Lazarian Reference Cho and Lazarian2003; Schekochihin et al. Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009; Nazarenko & Schekochihin Reference Nazarenko and Schekochihin2011; Howes & Nielson Reference Howes and Nielson2013). However, when slow modes are replaced by collisionless IAWs, these roles nearly reverse, with acoustic modes propagating too fast to be mixed effectively by the AWs, and in some cases, even demonstrating an ability to reflect Alfvénic fluctuations.

Still, certain aspects of these waves that may be important in a turbulence setting are not considered in this study. In particular, because we focused solely on wave amplitudes for which the induced pressure anisotropies satisfy $|\varDelta |\lesssim 1/\beta$, we do not consider the effects of the firehose or mirror instabilities. Previous work has shown that AWs, IAWs, and other compressive fluctuations evolve quite differently if their self-generated pressure anisotropies are large enough in magnitude to excite these Larmor-scale instabilities (Squire et al. Reference Squire, Kunz, Quataert and Schekochihin2017a; Kunz et al. Reference Kunz, Squire, Schekochihin and Quataert2020; Majeski, Kunz & Squire Reference Majeski, Kunz and Squire2023). As demonstrated from first principles by Arzamasskiy et al. (Reference Arzamasskiy, Kunz, Squire, Quataert and Schekochihin2023), the anomalous scattering that results from these instabilities is likely to influence the entire turbulent cascade, yielding yet another source of non-local energy transfer.

As mentioned in § 2, this study was performed assuming cold electrons. However, generalizing to electrons with finite temperature is quite straightforward, as long as the electrons remain barotropic. With an electron pressure tensor that satisfies $\boldsymbol{\mathsf{p}}_{{\rm e}} = p_{{\rm e}} \boldsymbol{\mathsf{I}} = (\rho T_{\rm e} /m_{\rm i})\boldsymbol{\mathsf{I}}$, no additional pressure anisotropy is produced by either wave, and so AW propagation is entirely unaffected. The only modification is to the ratio of the real and complex parts of the IAW frequency $\omega _{{\rm C,i}}/\omega _{{\rm C,r}}$, which, unless the electrons partake in Landau damping of the acoustic mode, decreases with increasing $T_{\rm e}$. This would bolster the effectiveness of the IAW–AW interaction by extending the lifetime of the pressure anisotropy and allowing the interaction to occur over more than one linear time of the frequency-matched modes.

In § 2.1, we noted the lack of $k_\perp$ as a key difference between these pressure-anisotropy-induced interactions and those found within standard Alfvénic turbulence. The nature of the pressure-anisotropic stress we highlight is significantly different from the Reynolds and Maxwell stresses, which dominate energy transfer in Alfvénic turbulence (Grete et al. Reference Grete, O'Shea, Beckwith, Schmidt and Christlieb2017). Most importantly, $\beta \varDelta$, being a scalar, will always affect $v_{{\rm A},\textrm {eff}}$ regardless of the ratio $k_\perp /k_\|$, doing so through the same nonlinearity that we retain in (2.6). Therefore, the effects of IAWs or AWs with $\beta \varDelta \sim 1$ are expected to remain leading order when $k_\perp \neq 0$. Furthermore, the linear time scales of IAWs and AWs are independent of $k_\perp$, thus the frequency-matching criterion of their interaction, and its spectral non-locality, are unlikely to change.

Given the further complications of microinstabilities and other high-$\beta$ effects such as magneto-immutability (Squire et al. Reference Squire, Kunz, Arzamasskiy, Johnston, Quataert and Schekochihin2023), formulating an analytical model for turbulence in this parameter regime remains a challenging task. The intention of this work is primarily to hint at the physics underlying changes in the observed behaviour of collisionless turbulence at $\beta \gg 1$, such as non-local energy transfer due to the pressure-anisotropic stress (Arzamasskiy et al. Reference Arzamasskiy, Kunz, Squire, Quataert and Schekochihin2023). To that end, we may still speculate upon some of the more qualitative consequences of the wave–wave interactions described herein. One of the most effective tools for describing strong turbulence is an appropriately reduced set of equations. For example, reduced MHD incorporates the concept of critical balance directly by ordering the linear frequency comparable to the nonlinear frequency, $\tau _\textrm {lin}^{-1} \doteq k_\parallel v_{\rm A} \sim \tau _\textrm {nl}^{-1} \approx k_\perp u_\perp$ (Goldreich & Sridhar Reference Goldreich and Sridhar1995; Schekochihin et al. Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009). Reinstating non-zero $k_\perp$ then, we can consider the reduced Alfvénic Elsässer equations that result from our 1-D high-$\beta$ ordering (2.4), modified to include $k_\|/k_\perp \sim \delta B_\parallel /B_0 \sim \epsilon$:

(4.1)\begin{align} &\frac{\partial \boldsymbol{z}^\pm_\perp}{\partial t} \mp v_{\rm A} \boldsymbol{\nabla}_\parallel \boldsymbol{z}^\pm_\perp{+} \boldsymbol{z}^\mp_\perp \boldsymbol{\cdot}\boldsymbol{\nabla}_\perp \boldsymbol{z}^\pm_\perp{+} \boldsymbol{\nabla}_\perp P_{\rm total} \nonumber\\ &\quad = v_{\rm A}\frac{\beta}{4}\boldsymbol{\nabla}_\parallel\left[(\boldsymbol{z}_\perp^+{-}\boldsymbol{z}_\perp^-)\varDelta\right] + \frac{\beta}{4}(\boldsymbol{z}_\perp^+{-}\boldsymbol{z}_\perp^-)\boldsymbol{\cdot}{\boldsymbol{\nabla}}_\perp\left[(\boldsymbol{z}_\perp^+{-}\boldsymbol{z}_\perp^-)\varDelta\right], \end{align}

where $P_\textrm {total}$ represents the combined perpendicular and magnetic pressures. Contained within the right-hand side of (4.1) are the various new effects discussed throughout this work that are absent in a collisional MHD model, or even a $\beta \sim \mathcal {O}(1)$ gyrokinetic model. The first term on the right-hand side incorporates the ability of AWs to travel at different $v_{{\rm A},\textrm {eff}}$, while both terms describe the deformation in $\boldsymbol {z}^\pm$ that results from wave–wave interactions. Equation (4.1) does not distinguish between the source of $\beta \varDelta$; be it generated by AWs with $\delta B_\perp /B_0 \sim \beta ^{-1/2}$ or IAWs with $\delta \rho /\rho \sim \beta ^{-1}$, the Alfvénic cascade will be modified. In each case, however, the description of the source of fluctuations can be more difficult. For example, IAWs act non-locally to affect Alfvénic fluctuations in $k$-space, but the reverse may not be true for Alfvénic mixing of acoustic waves. In that case, there is not necessarily a single acoustic Elsässer equation that can be written at each scale, because multiple $\boldsymbol {k}_\perp$ and $k_\parallel$ are inherent to the problem. Furthermore, in a collisionless turbulent cascade, $\beta \varDelta$ cannot remain $\mathcal {O}(1)$ at every wavenumber throughout the inertial range. As all of the other quantities cascade, $\beta$ will remain constant, so it is likely that interactions mediated by $\varDelta$ are a leading-order effect only in a specific portion of the inertial range where $\beta \varDelta \sim 1$. One subtle conclusion of the above reduced equations is that pressure balance in these plasmas should be struck primarily between $B^2$ and the perpendicular pressure $p_\perp$, rather than the isotropic pressure (Squire et al. Reference Squire, Kunz, Arzamasskiy, Johnston, Quataert and Schekochihin2023). The leading order of (4.1) implies that $\delta p_\perp /p_0 \sim \beta ^{-1}\delta B_\parallel /B_0 \sim \epsilon ^2$. In essence, the anisotropy generated should be dominated by the parallel pressure perturbation. This was found empirically in simulations of incompressibly driven turbulence using the same CGL-MHD Athena++ code employed here (Squire et al. Reference Squire, Kunz, Arzamasskiy, Johnston, Quataert and Schekochihin2023), as well as in the hybrid-kinetic simulations of Arzamasskiy et al. (Reference Arzamasskiy, Kunz, Squire, Quataert and Schekochihin2023).

Additional effects may be uncovered by studying the weak turbulence resulting from these wave–wave interactions. Isolated wave packets are not typically considered within strong turbulence; however, the long deformation times associated with weak turbulence allows waves to remain correlated over many local Alfvén-crossing times (Iroshnikov Reference Iroshnikov1964; Kraichnan Reference Kraichnan1968). This also expands upon the range of scales over which $\beta \varDelta$ exerts an influence by not requiring that the interaction remain strong. One effect that may be impactful to weak turbulence is the ability of AWs to modify each others’ $k_\parallel$. Current understandings of weak AW interactions involve resonance conditions describing three-wave interactions (Galtier et al. Reference Galtier, Nazarenko, Newell and Pouquet2000):

(4.2)\begin{gather} \boldsymbol{k}_1 + \boldsymbol{k}_2 = \boldsymbol{k}_3 \quad\Longrightarrow\quad k_{\parallel1}+k_{\parallel2}=k_{\parallel3} \end{gather}
(4.3)\begin{gather}\omega(\boldsymbol{k}_1^\pm) + \omega(\boldsymbol{k}_2^\mp) = \omega(\boldsymbol{k}_3^\pm) \quad\Longrightarrow\quad k_{\parallel1}-k_{\parallel2}=k_{\parallel3}, \end{gather}

which demand that $k_{\parallel 2}=0$ and $k_{\parallel 1} = k_{\parallel 3}$. However, these resonance conditions assume that $v_{\rm A}(k_\parallel ) = v_{\rm A}$, which is subject to modification if $v_{{\rm A},\textrm {eff}}$ is a function of the fluctuation amplitude, and hence the scale $k_\parallel$ of concern. In fact, for $\beta \varDelta \ll 1$, the frequency resonance condition would become

(4.4)\begin{equation} v_{\rm A}k_{{\parallel} 1}\left[1+ \frac{\beta}{4} \varDelta(k_{{\parallel} 1})\right] - v_{\rm A}k_{{\parallel} 2}\left[1 + \frac{\beta}{4} \varDelta(k_{{\parallel} 2})\right] \approx v_{\rm A}k_{{\parallel} 3} \left[1+\frac{\beta}{4} \varDelta(k_{{\parallel} 3})\right]. \end{equation}

Unlike in weak MHD Alfvénic turbulence, a $k_{\parallel 2} = 0$ mode is no longer a solution of the above condition by virtue of $\varDelta (k_{\parallel 2})$ increasing with decreasing $k_{\parallel 2}$ (assuming a cascade to higher wavenumbers for $\varDelta$). This holds regardless of the additional fact that co-propagating interactions can occur, which have been shown to change $k_\parallel$ as well. Together, these effects may be capable of modifying the onset of critical balance, changing the transition scale between weak and strong turbulence. Well-crafted simulations of Alfvénic and compressive turbulence in both collisionless and weakly collisional high-$\beta$ plasmas are the clear next step.

Acknowledgements

The authors thank Jonathan Squire, Alexander Schekochihin, Eliot Quataert and the participants of the 14th Plasma Kinetics Working Meeting at the Wolfgang Pauli Institute in Vienna for useful discussions, as well as the two anonymous reviewers for comments that helped sharpen the presentation.

Editor Alex Schekochihin thanks the referees for their advice in evaluating this article.

Funding

S.M. and M.W.K. were supported in part by NSF CAREER Award No. 1944972. High-performance computing resources were provided by the PICSciE-OIT TIGRESS High Performance Computing Center and Visualization Laboratory at Princeton University.

Declaration of interests

The authors report no conflict of interest.

Appendix A. Energy conservation in IAW–AW interactions

Without going to higher order in $\epsilon$ when deriving an equation for the evolution of the ion acoustic wave, the solution (2.11) describing the interaction of AWs with ion acoustic waves does not conserve the total energy. The reasoning for this is that, while the effect of the acoustic wave's pressure anisotropy on the AW is leading order in $\epsilon$, the back reaction of the AW on the acoustic wave occurs at $\mathcal {O}(\epsilon ^2)$. As assurance for the concerned reader, however, we demonstrate here that by going to at least third order in the compressive dynamical equations, energy conservation can be re-established. Additionally, this result aids in confirming that the frequency matching condition $k_{\rm A} \sim \sqrt {\beta }k_{\rm C}$ does not violate the assumptions used to obtain the ordered equations (2.6) and (2.7). To achieve this task, we begin by writing down the total energy of the background plus perturbations, which we normalize using $\rho _0 v_{\textrm {th},{\rm i}}^2 = 2p_0$ to make the orders of each contribution more apparent. Following this, we evaluate the time derivative of the total energy using the evolution equations (2.1), with the proper adjustments made for our assumptions about the problem geometry. Once this has been done, we examine the orders of the remaining terms to demonstrate that energy conservation is re-established by including all terms of order $\epsilon ^3$ or higher.

The total energy normalized to $\rho _0 v_{\textrm {th},{\rm i}}^2$ is given by

(A1)\begin{equation} \frac{E}{\rho_0 v_{{\rm th},{\rm i}}^2} = \int \mathrm{d}x \left\{ \frac{1}{\beta}\frac{\rho}{\rho_0} \left( \frac{u_\perp}{v_{\rm A}} \right)^2 +\frac{1}{\beta}\left(1+\frac{\delta B_\perp^2}{B_0^2} \right) + \frac{\rho}{\rho_0} \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}}\right)^2 + \frac{p_\perp}{p_0}+ \frac{1}{2}\frac{p_\parallel}{p_0} \right\}. \end{equation}

Already, it is clear from (A1) that, according to the ordering (2.4), the Alfvénic contributions to the energy (terms involving $u_\perp$ and $\delta B_\perp$) are at most of order $\epsilon ^3$, while the compressive contributions (the last three terms) are up to linear in $\epsilon$. Taking the time derivative of (A1), the rate of change of the total energy is given by

(A2)\begin{align} \frac{1}{\rho_0 v_{{\rm th},{\rm i}}^2}\frac{\mathrm{d} E}{\mathrm{d}t}&= \int \mathrm{d}x \left\{ \frac{1}{\beta} \frac{\partial }{\partial t} \left(\frac{\delta \rho}{\rho_0}\right) \left(\frac{u_\perp}{v_{\rm A}} \right)^2 + \frac{1}{\beta} \left( \frac{\rho}{\rho_0} \right) \frac{\partial }{\partial t} \left( \frac{u_\perp}{v_{\rm A}} \right)^2 + \frac{1}{\beta} \frac{\partial }{\partial t} \left(\frac{\delta B_\perp}{B_0} \right)^2 \right.\nonumber\\ &\quad \left. + \frac{\partial }{\partial t} \left(\frac{\delta \rho}{\rho_0}\right) \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2 + \left( \frac{\rho}{\rho_0} \right) \frac{\partial }{\partial t} \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2 + \frac{\partial }{\partial t}\frac{\delta p_\perp}{p_0} +\frac{1}{2}\frac{\partial }{\partial t}\frac{\delta p_\parallel}{p_0} \right\}. \end{align}

We examine these terms in turn with the aim of establishing the lowest order at which they appear.

First, we focus on those terms relating directly to the energy carried by the Alfvénic fluctuations, i.e. the top line of (A2). Denote the characteristic Alfvénic and compressive wavenumbers by $k_{\rm A}$ and $k_{\rm C}$, respectively. The first term is of order $k_{\rm C} c_{\rm s} \epsilon ^4$ (due to the continuity equation's compressive nature in $\partial _t$), while the second and third terms are of order $k_{\rm A} v_{\rm A} \epsilon ^3$. For all circumstances studied in this paper then (including frequency-matched fluctuations), the leading order of the second and third terms will dominate and be the main mechanism by which the Alfvénic fluctuations extract energy from the compressive ones. Therefore, neglecting the first term and inserting (2.5) for $\partial _t u_\perp$ and $\partial _t \delta B_\perp$, the energy rate of change simplifies to

(A3)\begin{align} \frac{1}{\rho_0 v_{{\rm th},{\rm i}}^2}\frac{\mathrm{d} E}{\mathrm{d}t} & \approx \int \mathrm{d}x \left\{ u_\perp \frac{\partial }{\partial x}\left(\frac{\delta B_\perp}{B_0} \frac{\varDelta p}{p_0} \right) + \frac{\partial }{\partial t} \left(\frac{\delta \rho}{\rho_0}\right) \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2 \right.\nonumber\\ &\quad \left.\vphantom{\left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2}+ \left( \frac{2\rho}{\rho_0} \right) \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \frac{\partial }{\partial t} \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right) + \frac{\partial }{\partial t}\frac{\delta p_\perp}{p_0} + \frac{1}{2}\frac{\partial }{\partial t}\frac{\delta p_\parallel}{p_0} \right\}. \end{align}

For $\omega _{\rm A} \approx \omega _{\rm C}$, the remaining Alfvénic term on the first line of (A3) is of the exact same order as the kinetic energy terms on the bottom line, while, for $k_{\rm A} \approx k_{\rm C}$, it is one half-order in $\epsilon$ smaller. To satisfy energy conservation, we must then include all compressive terms that are up to, but not including, those of order $\epsilon ^4$.

Before embarking on the compressive terms, however, we note that the heat fluxes need not be included in (2.1e) and (2.1d) for the proof of energy conservation between Alfvén and acoustic waves. Their effect is to diffuse wave energy into the background, and not to facilitate communication between the Alfvénic and compressive fluctuations. We therefore proceed by evolving the pressure perturbations according to the double adiabats (2.13), which can be combined in the following manner:

(A4)\begin{align} \frac{\partial p_\perp}{\partial t} + \frac{1}{2} \frac{\partial p_\parallel}{\partial t} &= \frac{1}{2}\frac{\Delta p}{B^2}\frac{\partial B^2}{\partial t} + \frac{3p_\parallel}{2\rho}\frac{\partial \rho}{\partial t} - \frac{u_\parallel}{2}\frac{\partial p_\parallel}{\partial x} + \frac{3u_\parallel}{2} \left( \frac{p_\parallel}{\rho}\right) \frac{\partial \rho}{\partial x}\nonumber\\ &\quad + \frac{p_\perp}{\rho} \frac{\partial \rho}{\partial t} - u_\parallel\rho\frac{\partial }{\partial x}\left(\frac{p_\perp}{\rho}\right) + u_\parallel \left(\frac{p_\perp}{B}\right) \frac{\partial B}{\partial x} - u_\parallel\left(\frac{p_\parallel}{B}\right) \frac{\partial B}{\partial x} . \end{align}

Noting that $\delta B \propto \delta B_\perp ^2$, which makes $B^{-1}\partial _xB$ second order in $\epsilon$, the final two terms cancel up to and including $\mathcal {O}(\epsilon ^3)$. We also use the fact that (A4) will be integrated over space to swap the remaining $x$-derivative on the bottom line from $p_\perp /\rho$ to $u_\parallel \rho$, picking up a sign change. Using the continuity equation to cancel the first two terms on the second line, no more of the second line remains. Normalization and integration of the first term on the right-hand side of (A4) then gives

(A5)\begin{equation} \int \mathrm{d}x\, \frac{1}{2} \left(\frac{\Delta p}{p_0}\right) \frac{\delta B_\perp}{B_0} \frac{\partial }{\partial t} \left(\frac{\delta B_\perp}{B_0}\right) ={-} \int \mathrm{d}x \,\frac{u_\perp}{2}\frac{\partial }{\partial x}\left(\frac{\delta B_\perp}{B_0} \frac{\Delta p}{p_0} \right), \end{equation}

where (2.5a) has been used to substitute for $\partial _t \delta B_\perp$. This term will cancel the first term on the right-hand side of (A3), leaving us with the following expression for the energy rate of change:

(A6)\begin{align} \frac{1}{\rho_0 v_{{\rm th},{\rm i}}^2}\frac{\mathrm{d} E}{\mathrm{d}t} & \approx \int \mathrm{d}x \left\{\vphantom{\left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2} \frac{3p_\parallel}{2p_0}\left(\frac{1}{\rho} \frac{\partial \rho}{\partial t}\right) - \frac{u_\parallel}{2p_0}\frac{\partial p_\parallel}{\partial x} + \frac{3u_\parallel}{2\rho} \left( \frac{p_\parallel}{p_0}\right) \frac{\partial \rho}{\partial x} \right.\nonumber\\ &\quad \left. + \frac{\partial }{\partial t} \left(\frac{\delta \rho}{\rho_0}\right) \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2 + \left( \frac{2\rho}{\rho_0} \right) \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \frac{\partial }{\partial t} \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right) \right\} + \mathcal{O}(\epsilon^4). \end{align}

At this point, it is may become obvious that, with all of the remaining terms being compressive in nature, they must cancel because the nonlinear interaction of a wave with itself generally conserves energy in the absence of dissipative effects like heat fluxes or collisions. Regardless, to solidify the argument, we will continue and demonstrate that there are no other Alfvénic feedback terms that appear at orders greater than $\epsilon ^4$. Using the continuity equation to simplify the top line of (A6) yields

(A7)\begin{equation} \int \mathrm{d}x \left\{ - \frac{3p_\parallel}{2p_0}\left[\frac{1}{\rho} \frac{\partial }{\partial x} (\rho u_\parallel)\right] - \frac{u_\parallel}{2p_0}\frac{\partial p_\parallel}{\partial x} + \frac{3u_\parallel}{2\rho} \left( \frac{p_\parallel}{p_0}\right) \frac{\partial \rho}{\partial x} \right\} = \int \mathrm{d}x \,\frac{u_\parallel}{p_0}\frac{\partial p_\parallel}{\partial x}, \end{equation}

which, after once again employing continuity, reduces (A6) to

(A8)\begin{equation} \frac{1}{\rho_0 v_{{\rm th},{\rm i}}^2}\frac{\mathrm{d} E}{\mathrm{d}t} \approx \int \mathrm{d}x \left\{\frac{u_\parallel}{p_0}\frac{\partial p_\parallel}{\partial x} + \frac{2\rho}{\rho_0} \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right)^2 \frac{\partial u_\parallel}{\partial x} + \left(\frac{2\rho}{\rho_0} \right) \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \frac{\partial }{\partial t} \left( \frac{u_\parallel}{v_{{\rm th},{\rm i}}} \right) \right\} + \mathcal{O}(\epsilon^4). \end{equation}

The fully nonlinear parallel momentum in this case is given by

(A9)\begin{equation} \rho \frac{\partial u_\parallel}{\partial t} ={-} \rho u_\parallel \frac{\partial u_\parallel}{\partial x} - \frac{\partial p_\parallel}{\partial x} - \frac{\partial }{\partial x} \left(\frac{\delta B_\perp^2}{B^2}\Delta p \right) - \delta B_\perp \frac{\partial \delta B_\perp}{\partial x} . \end{equation}

Multiplying (A9) by $u_\parallel /\rho _0 v_{\textrm {th},{\rm i}}^2 = u_\parallel /2p_0$, the last two terms on its right-hand side become of order $\mathcal {O}(\epsilon ^4)$, while the first two cancel the remaining terms in (A8). As a result, we have shown that, up to and including $\mathcal {O}(\epsilon ^3)$ terms, total energy is conserved by the interaction of Alfvén and ion-acoustic waves according to the ordering (2.4). This finding tells us that, even when the frequencies of the two modes are matched and $k_{\rm A} \sim \sqrt {\beta }k_{\rm C}$, the leading order for the dynamical equations describing each wave remains unchanged. Naturally, no terms neglected in deriving (2.5) are enhanced with respect to the leading order by letting $k_{\rm A}$ increase by $\epsilon ^{-1/2}$. At the same time, Alfvénic feedback effects on the compressive fluctuations were of order $\epsilon ^3$, meaning the leading order compressive equations (2.7) remain the same until $\omega _{\rm A} \sim \beta ^{2}\omega _{\rm C}$. Nonlinear steepening will remain far more important in the frequency-matched regime of strong interaction, thus for (small) acoustic fluctuations of amplitude ${\sim }\beta ^{-1}$, the assumption of linearity is quite robust.

Appendix B. The $\varDelta$ profile of a Landau-fluid AW packet

This appendix provides a derivation of the pressure anisotropy generated by an isolated, parallel-propagating, large-amplitude ($\delta B_\perp /B_0 \sim \beta ^{-1/2}$) AW packet, specifically for the diffusive heat fluxes of a Landau-fluid-type model. As discussed in § 2.3, we predict that AWs of this type will steepen to form shocks in the profile of their $\delta B_\perp$ and $u_\perp$ perturbations. These shocks will propagate at a speed near $v_{\rm A}$, which is much slower than the thermal speed of the plasma, allowing the change in pressure anisotropy generated by $\delta B_\perp$ to diffuse upstream ahead of the shock. Here we derive the functional form of $\varDelta (x)$ as it propagates with the shock.

The first step is to introduce another ordering that describes this large-amplitude AW near the interruption limit and which is distinct from (2.4). The need for a new ordering originates from the fact that in the AW–AW interaction, the primary AW cannot be derived from the ordering (2.4) as (2.4) relies on linear perturbations to produce the pressure anisotropy. As stated in § 2.3, the primary AW induces pressure anisotropy nonlinearly. Therefore, to produce $\varDelta \sim \beta ^{-1}$, it must have $\delta B_\perp /B_0$ larger than that of the secondary packet, which obeys the Alfvénic components of (2.4). Our new ordering must then capture the fact that AWs perturb the pressure and other compressive fields nonlinearly, while being consistent with the high-$\beta$ and $\beta \varDelta \sim 1$ assumptions of (2.4) (such that the resultant $\varDelta (x)$ may be used in (2.6)):

(B1)\begin{equation} \frac{\delta B_\perp^2}{B_0^2} \sim \frac{u_\perp^2}{v_{\rm A}^2} \sim \frac{\delta \rho}{\rho_0} \sim \frac{u_\parallel}{v_{\rm A}} \sim \frac{1}{\beta} \sim \varDelta \sim \sigma^2 \sim \epsilon. \end{equation}

Here, $\sigma$ is a new expansion parameter satisfying $\sigma \gg \epsilon$. The estimates of $\partial _t$ for compressive and Alfvénic fields are the same as those used with (2.4) (${\sim }k c_{\rm s}$ and ${\sim }kv_{\rm A}$, respectively). Note that as an ordering, this is unaffected by the Alfvénic dynamics following $v_{{\rm A},\textrm {eff}}$ rather than $v_{\rm A}$, as the pressure anisotropy is strictly positive and thus will never approach the firehose limit where $v_{{\rm A},\textrm {eff}} \ll v_{\rm A}$.

The next step is to use (B1) to obtain an estimate for the size of the parallel pressure perturbation $\delta p_\parallel$, and show that it is much smaller than the perpendicular pressure perturbation $\delta p_\perp$. We apply (B1) to the parallel momentum equation evaluated downstream of the shock (where the gradient in $\delta B_\perp$ is non-existent), finding that the leading-order equation is simply linear:

(B2)\begin{equation} \rho_0 \frac{\partial u_\parallel}{\partial t} \approx{-}\frac{\partial \delta p_\parallel}{\partial x}. \end{equation}

This approximate equation implies that $\delta p_\parallel \sim \sigma ^2 \rho _0 v_{\rm A}c_{\rm s}$, or $\delta p_\parallel /p_0 \sim \sigma ^3$. Comparatively, no such restraint is placed on $\delta p_\perp$ by the perpendicular momentum equation, so the pressure anisotropy must be dominated by $\delta p_\perp$. The equation for $\delta p_\perp$ is in this case more easily formulated in terms of the perpendicular temperature $\delta T_\perp$, which according to (B1) and (2.1d) evaluated downstream from the shock satisfies

(B3)\begin{equation} \frac{\partial \delta T_\perp}{\partial t} \approx \kappa \frac{\partial ^2 \delta T_\perp}{\partial x^2}, \end{equation}

where $\kappa$ is a diffusion coefficient (equal to $\rho _0v_\textrm {th}/\sqrt {{\rm \pi} }|k_\||$ in the ‘$3+1$’ Landau-fluid model; Snyder et al. Reference Snyder, Hammett and Dorland1997). Given that $\kappa$ operates on thermal time scales, the diffusion of $\delta T_\perp$, and thus $\Delta p$, downstream is effectively instantaneous as the AW packet propagates. In the frame of the shock then, $\delta T_\perp$ attains a steady-state profile, meaning that it can be represented as $\delta T_\perp (x-v_{{\rm A},\max }t)$ (where $v_{{\rm A},\max }$ is a constant equal to the peak value of $v_{{\rm A},\textrm {eff}}$ evaluated at the shock front). Accordingly, the relation (B3) becomes

(B4)\begin{equation} \frac{\partial^2 \delta T_\perp}{\partial x^2} + \frac{v_{{\rm A},\max}}{\kappa} \frac{\partial \delta T_\perp}{\partial x} \approx 0 \quad \Longrightarrow \quad \delta T_\perp{\approx} \delta T_{{\perp},\max} \exp \left[\frac{v_{{\rm A},\max}}{\kappa}(v_{{\rm A},\max}t-x)\right]. \end{equation}

This gives the functional form of $\delta T_\perp$ leading up to the shock front (where $\delta T_{\perp, \max }$ is determined), and motivates our choice of exponentially decaying pressure anisotropy as a source for (2.6) used in § 2.3. To test the accuracy of this estimate of the decay length of $\delta T_\perp$, a series of Landau-fluid simulations were performed with varying choices of $\beta$ and for $|k_\||$ in (2.3), shown in figure 7. The set-up of the wave packet in these runs is the same as that of the primary packet described in § 3.2. The decay length of $\delta T_\perp$ is found in these simulations by waiting until the wave packet steepens to form a shock, then fitting an exponentially decaying function to the region directly in front of the peak of the magnetic-field perturbation. Compared with the theoretical estimate using the ‘$3+1$’ heat fluxes, strong agreement is found.

Figure 7. Comparison of the analytically predicted and simulated decay lengths of the pressure anisotropy precursor of a steepened AW packet, for $\boldsymbol {q}_{\perp /\|}$ given by the ‘$3+1$’ heat fluxes (2.3). The calculated decay length $\lambda _{{\rm T},\perp }$ is normalized by the dominant wavelength of the AW packet $\lambda _{{\rm A}}$. On the left, the decay length is varied with respect to the Landau wavenumber $|k_\||$, while on the right, it is varied with respect to $\beta$.

The actual magnitude of $\delta T_{\perp, \max }$ is determined by acknowledging that $\partial _x \delta B_\perp ^2$ is large in the region of the shock, and including its associated term at leading order in the equation left of the arrow in (B4) provides

(B5)\begin{equation} \frac{\partial^2 \delta T_\perp}{\partial x^2} + \frac{v_{{\rm A},\max}}{\kappa} \frac{\partial \delta T_\perp}{\partial x} \approx \frac{v_{{\rm A},\max} T_0}{2\kappa B_0^2}\frac{\partial \delta B_\perp^2}{\partial x}. \end{equation}

Integrating this equation from the shock front to the distant downstream region where $\partial _x \delta T_\perp \approx 0$, we obtain the simple result that $\delta T_{\perp, \max }/T_0 \approx (1/2)\delta B_{\perp, \max }^2/B_0^2$. Knowing that $\delta p_\parallel$ may be neglected, the same steps can be applied to (2.1e) to obtain a diffusion equation for $\delta \rho$, which is found to decay on a length scale of $(3/2)\kappa /v_{{\rm A},\max }$ instead of $\kappa /v_{{\rm A},\max }$. Similarly, $\delta \rho _{\max }$ is determined to be $\delta \rho _{\max }\approx (1/3)\delta B_{\perp, \max }^2/B_0^2$. Combined, these yield the final result that $\varDelta _{\max } \approx \delta p_{\perp, \max }/p_0 \approx (5/6)\delta B_{\perp, \max }^2/B_0^2$, and $v_{{\rm A},\max } = v_{\rm A}\sqrt {1+\beta \varDelta _{\max }/2}$.

Footnotes

1 $\beta \doteq 8{\rm \pi} p/B^2$ is the ratio of the thermal pressure of the plasma particles, $p$, and the energy density stored in the magnetic field, $B^2/8{\rm \pi}$, or equivalently twice the ratio of the squares of the sound speed $c_{{\rm s}}$ and the Alfvén speed $v_{{\rm A}}$, viz. $\beta \doteq 2c_{{\rm s}}^2/v_{{\rm A}}^2$. In this paper, ‘high-$\beta$’ means $\beta \gg 1$.

2 We are primarily concerned with effects on the inertial range of Alfvénic turbulence where the fluctuations are spatially anisotropic with respect to the background magnetic field with $k_\parallel /k_\perp \ll 1$ (e.g. Goldreich & Sridhar Reference Goldreich and Sridhar1995). Within intervals of the cascade not exhibiting this predominantly perpendicular polarization (e.g. near the forcing scale(s)), strong interactions between slow and Alfvén waves can occur (e.g. Howes & Nielson Reference Howes and Nielson2013).

3 It has recently been found that certain conditions lead to the dominance of the resonant oblique firehose instability, which has a threshold of $\beta _\| \varDelta \approx -1.4$ rather than ${\approx }-2$ (Hellinger & Matsumoto Reference Hellinger and Matsumoto2000; Bott et al., in preparation). At this more restrictive threshold, the magnetic tension is not nullified by the pressure anisotropy and so full interruption does not occur. Within the purview of this work, we do not rely on the realizability of interruption, nor do we address the role of microinstabilities in detail.

4 A non-decimal expression is available given that the ‘$3+1$’ heat fluxes yield a cubic dispersion relation. However, the analytic expression in this case is more complicated than it is useful.

5 For the same reason, the dynamical equations to leading order for each wave are unaffected by frequency matching. Although an increase in the gradients of Alfvénic quantities enhances their ability to affect the acoustic mode, the fact that those Alfvénic gradient terms are multiple orders of $\epsilon$ smaller than the linear compressive terms means that increasing $k_{\rm A}$ by a factor of $\sqrt {\beta }$ or even $\beta$ will not render the solution (2.11) inaccurate.

6 Although ignored for illustrative purposes here, it is never justified to neglect the heat fluxes entirely in the evolution of high-$\beta$ collisionless plasmas. Owing to their thermal nature, the contribution of heat fluxes in the pressure evolution equations (2.1d) and (2.1e) is typically on the order of ${\sim }k_\parallel c_{\rm s} \delta p_{\perp /\parallel }$. In high-$\beta$ plasmas, this time scale ($k_\parallel c_{\rm s}$) is comparable to the rate-of-change of $p_{\perp /\parallel }$ generated by acoustic fluctuations ($\omega _{\rm C}$) and much larger than that of Alfvénic fluctuations ($k_\parallel v_{\rm A}$).

7 Note that we only address shear AWs within this work. In the case of an infinite wave train of torsional AWs, no pressure anisotropy is produced, although they may still be affected by an externally supplied pressure anisotropy. In the case of a torsional AW packet, however, the front of the packet would still produce $\varDelta > 0$ while the tail would generate $\varDelta < 0$.

8 Squire et al. (Reference Squire, Quataert and Schekochihin2016) also found that as AW amplitudes approach the ‘interruption limit’, they are reshaped into square waves. Such a profile minimizes the variation in $|B|^2$, thereby minimizing the generation of $\varDelta$. Although this effect has little impact on the interaction of the waves, it is observed in our simulations of large-amplitude AW packets.

9 We have made no mention of energy conservation in the AW–AW interaction, even though the energy of the secondary packet increases. This is analogous to the IAW–AW interaction, given the $\sigma \sim \sqrt {\epsilon }$ ordering used in Appendix B. The energy gain of the secondary packet is first order to itself, but higher order to the larger amplitude primary packet.

10 In a periodic box, any finite-width wave packet that increases the local pressure anisotropy will, subject to heat fluxes, increase the background pressure anisotropy as well (and quite rapidly at high $\beta$). To simulate a wave packet that is much smaller than the extent of the background plasma then, only outflow boundary conditions can appropriately maintain the pressure anisotropy as a local perturbation, while removing the portion of $\Delta p$ that diffuses out away from the packet.

References

Alt, A. & Kunz, M.W. 2019 Onset of magnetic reconnection in a collisionless, high-$\beta$ plasma. J. Plasma Phys. 85 (1), 175850102.CrossRefGoogle Scholar
Arzamasskiy, L., Kunz, M.W., Squire, J., Quataert, E. & Schekochihin, A.A. 2023 Kinetic turbulence in collisionless high-$\beta$ plasmas. Phys. Rev. X 13 (2), 021014.Google Scholar
Bacchini, F., Arzamasskiy, L., Zhdankin, V., Werner, G.R., Begelman, M.C. & Uzdensky, D.A. 2022 Fully kinetic shearing-box simulations of magnetorotational turbulence in 2D and 3D. I. Pair plasmas. Astrophys. J. 938 (1), 86.CrossRefGoogle Scholar
Bott, A.F.A., Arzamasskiy, L., Kunz, M.W., Quataert, E. & Squire, J. 2021 Adaptive critical balance and firehose instability in an expanding, turbulent, collisionless plasma. Astrophys. J. Lett. 922 (2), L35.CrossRefGoogle Scholar
Cassak, P.A., Baylor, R.N., Fermo, R.L., Beidler, M.T., Shay, M.A., Swisdak, M., Drake, J.F. & Karimabadi, H. 2015 Fast magnetic reconnection due to anisotropic electron pressure. Phys. Plasmas 22 (2), 020705.CrossRefGoogle Scholar
Chew, G.F., Goldberger, M.L. & Low, F.E. 1956 The Boltzmann equation and the one-fluid hydromagnetic equations in the absence of particle collisions. Proc. R. Soc. Lond. A 236, 112.Google Scholar
Cho, J. & Lazarian, A. 2003 Compressible magnetohydrodynamic turbulence: mode coupling, scaling relations, anisotropy, viscosity-damped regime and astrophysical implications. Mon. Not. R. Astron. Soc. 345 (12), 325.CrossRefGoogle Scholar
Egedal, J., Gurram, H., Greess, S., Daughton, W. & , A. 2023 The force balance of electrons during kinetic anti-parallel magnetic reconnection. Phys. Plasmas 30 (6), 062106.CrossRefGoogle Scholar
Elsässer, W.M. 1950 The hydromagnetic equations. Phys. Rev. 79 (1), 183.CrossRefGoogle Scholar
Galtier, S., Nazarenko, S.V., Newell, A.C. & Pouquet, A. 2000 A weak turbulence theory for incompressible magnetohydrodynamics. J. Plasma Phys. 63 (5), 447.CrossRefGoogle Scholar
Goldreich, P. & Sridhar, S. 1995 Toward a theory of interstellar turbulence. 2: strong Alfvénic turbulence. Astrophys. J. 438, 763.CrossRefGoogle Scholar
Grete, P., O'Shea, B.W., Beckwith, K., Schmidt, W. & Christlieb, A. 2017 Energy transfer in compressible magnetohydrodynamic turbulence. Phys. Plasmas 24 (9), 092311.CrossRefGoogle Scholar
Hellinger, P. & Matsumoto, H. 2000 New kinetic instability: oblique Alfvén fire hose. J. Geophys. Res. 105, 10519.CrossRefGoogle Scholar
Howes, G.G. & Nielson, K.D. 2013 Alfvén wave collisions, the fundamental building block of plasma turbulence. I. Asymptotic solution. Phys. Plasmas 20 (7), 072302.CrossRefGoogle Scholar
Iroshnikov, P.S. 1964 Turbulence of a conducting fluid in a strong magnetic field. J. Sov. Astron. 7, 566.Google Scholar
Kawazura, Y., Barnes, M. & Schekochihin, A.A. 2019 Thermal disequilibration of ions and electrons by collisionless plasma turbulence. Proc. Natl Acad. Sci. USA 116 (3), 771.CrossRefGoogle ScholarPubMed
Komarov, S., Schekochihin, A.A., Churazov, E. & Spitkovsky, A. 2018 Self-inhibiting thermal conduction in a high-$\beta$, whistler-unstable plasma. J. Plasma Phys. 84, 905840305.CrossRefGoogle Scholar
Kraichnan, R.H. 1968 Small-scale structure of a scalar field convected by turbulence. Phys. Fluids 11, 945.CrossRefGoogle Scholar
Kunz, M.W., Abel, I.G., Klein, K.G. & Schekochihin, A.A. 2018 Astrophysical gyrokinetics: turbulence in pressure-anisotropic plasmas at ion scales and beyond. J. Plasma Phys. 84 (2), 715840201.CrossRefGoogle Scholar
Kunz, M.W., Squire, J., Balbus, S.A., Bale, S.D., Chen, C.H.K., Churazov, E., Cowley, S.C., Forest, C.B., Gammie, C.F., Quataert, E., et al. 2019 [Plasma 2020 Decadal] The material properties of weakly collisional, high-$\beta$ plasmas. arXiv:1903.04080.Google Scholar
Kunz, M.W., Squire, J., Schekochihin, A.A. & Quataert, E. 2020 Self-sustaining sound in collisionless, high-${\beta }$ plasma. J. Plasma Phys. 86, 905860603.CrossRefGoogle Scholar
Kunz, M.W., Stone, J.M. & Quataert, E. 2016 Magnetorotational turbulence and dynamo in a collisionless plasma. Phys. Rev. Lett. 117 (23), 235101.CrossRefGoogle Scholar
Lithwick, Y. & Goldreich, P. 2001 Compressible magnetohydrodynamic turbulence in interstellar plasmas. Astrophys. J. 562 (1), 279.CrossRefGoogle Scholar
Majeski, S., Kunz, M.W. & Squire, J. 2023 Microphysically modified magnetosonic modes in collisionless, high-${\beta }$ plasmas. J. Plasma Phys. 89, 905890303.CrossRefGoogle Scholar
Montgomery, D. & Turner, L. 1981 Anisotropic magnetohydrodynamic turbulence in a strong external magnetic field. Phys. Fluids 24, 825.CrossRefGoogle Scholar
Nazarenko, S.V. & Schekochihin, A.A. 2011 Critical balance in magnetohydrodynamic, rotating and stratified turbulence: towards a universal scaling conjecture. J. Fluid Mech. 677, 134.CrossRefGoogle Scholar
Quataert, E. 2003 Radiatively inefficient accretion flow models of Sgr A$^\ast$. Astron. Nachr. 324, 435443.CrossRefGoogle Scholar
Rincon, F., Schekochihin, A.A. & Cowley, S.C. 2015 Non-linear mirror instability. Mon. Not. R. Astron. Soc. 447, L45.CrossRefGoogle Scholar
Roberg-Clark, G.T., Drake, J.F., Swisdak, M. & Reynolds, C.S. 2018 Wave generation and heat flux suppression in astrophysical plasma systems. Astrophys. J. 867, 154.CrossRefGoogle Scholar
Rudakov, L.I. & Sagdeev, R.Z. 1958 Plasma Physics and the Problem of Controlled Thermonuclear Reactions, vol. 3. Acad. Sci. Press, p. 153.Google Scholar
Sagdeev, R.Z. & Galeev, A.A. 1969 Nonlinear Plasma Theory. W. J. Benjamin, Inc.Google Scholar
Sandoval, A., Riquelme, M., Spitkovsky, A. & Bacchini, F. 2023 Particle-in-cell simulations of the magnetorotational instability in stratified shearing boxes. arXiv:2308.12348.Google Scholar
Santos-Lima, R., de Gouveia Dal Pino, E.M., Kowal, G., Falceta-Gonçalves, D., Lazarian, A. & Nakwacki, M.S. 2014 Magnetic field amplification and evolution in turbulent collisionless magnetohydrodynamics: an application to the intracluster medium. Astrophys. J. 781 (2), 84.CrossRefGoogle Scholar
Schekochihin, A.A., Cowley, S.C., Dorland, W., Hammett, G.W., Howes, G.G., Quataert, E. & Tatsuno, T. 2009 Astrophysical gyrokinetics: kinetic and fluid turbulent cascades in magnetized weakly collisional plasmas. Astrophys. J. Suppl. 182, 310.CrossRefGoogle Scholar
Sharma, P., Hammett, G.W., Quataert, E. & Stone, J.M. 2006 Shearing box simulations of the MRI in a collisionless plasma. Astrophys. J. 637, 952.CrossRefGoogle Scholar
Sharma, P., Quataert, E., Hammett, G.W. & Stone, J.M. 2007 Electron heating in hot accretion flows. Astrophys. J. 667, 714.CrossRefGoogle Scholar
Sironi, L., Comisso, L. & Golant, R. 2023 Generation of near-equipartition magnetic fields in turbulent collisionless plasmas. Phys. Rev. Lett. 131 (5), 055201.CrossRefGoogle ScholarPubMed
Snyder, P.B., Hammett, G.W. & Dorland, W. 1997 Landau fluid models of collisionless magnetohydrodynamics. Phys. Plasmas 4, 3974.CrossRefGoogle Scholar
Southwood, D.J. & Kivelson, M.G. 1993 Mirror instability. I – physical mechanism of linear instability. J. Geophys. Res. 98, 9181.CrossRefGoogle Scholar
Squire, J., Kunz, M.W., Arzamasskiy, L., Johnston, Z., Quataert, E. & Schekochihin, A.A. 2023 Pressure anisotropy and viscous heating in weakly collisional plasma turbulence. J. Plasma Phys. 89, 905890417.CrossRefGoogle Scholar
Squire, J., Kunz, M.W., Quataert, E. & Schekochihin, A.A. 2017 a Kinetic simulations of the interruption of large-amplitude shear-Alfvén waves in a high-$\beta$ plasma. Phys. Rev. Lett. 119 (15), 155101.CrossRefGoogle Scholar
Squire, J., Quataert, E. & Schekochihin, A.A. 2016 A stringent limit on the amplitude of Alfvénic perturbations in high-beta low-collisionality plasmas. Astrophys. J. 830, L25.CrossRefGoogle Scholar
Squire, J., Schekochihin, A.A. & Quataert, E. 2017 b Amplitude limits and nonlinear damping of shear-Alfvén waves in high-beta low-collisionality plasmas. New J. Phys. 19, 055005.CrossRefGoogle Scholar
St-Onge, D.A. & Kunz, M.W. 2018 Fluctuation dynamo in a collisionless, weakly magnetized plasma. Astrophys. J. Lett. 863, L25.CrossRefGoogle Scholar
St-Onge, D.A., Kunz, M.W., Squire, J. & Schekochihin, A.A. 2020 Fluctuation dynamo in a weakly collisional plasma. J. Plasma Phys. 86 (5), 905860503.CrossRefGoogle Scholar
Stone, J.M., Gardiner, T.A., Teuben, P., Hawley, J.F. & Simon, J.B. 2008 Athena: a new code for astrophysical MHD. Astrophys. J. 178, 137.CrossRefGoogle Scholar
Winarto, H.W. & Kunz, M.W. 2022 Triggering tearing in a forming current sheet with the mirror instability. J. Plasma Phys. 88, 905880210.CrossRefGoogle Scholar
Zhou, M., Zhdankin, V., Kunz, M.W., Loureiro, N.F. & Uzdensky, D.A. 2024 Magnetogenesis in a collisionless plasma: from Weibel instability to turbulent dynamo. Astrophys. J. 960 (1), 12.CrossRefGoogle Scholar
Figure 0

Figure 1. (a) Characteristic curves for an AW subjected to an exponentially decaying pressure anisotropy (2.18) with characteristic decay length $l_\varDelta =0.1L$ and phase speed $v_{{\rm A},\textrm {eff}}=\sqrt {2}v_{\rm A}$. Characteristics originating near the left of the domain begin to converge by the time they reach the right side of the domain, indicating $\varDelta$-induced AW steepening. (b) Energy gain by the secondary AW packet after one Alfvén-crossing time for different $l_\varDelta$, demonstrating that a steeper $\varDelta$ profile leads to more rapid gain in energy by the secondary packet.

Figure 1

Figure 2. Landau-fluid CGL-MHD simulation of the interaction between the pressure anisotropy driven by an IAW packet (black) and an AW packet (orange) with matched frequencies. The IAW packet collisionlessly damps rapidly from the initial conditions (top frame), before nonlinearly deforming the short-wavelength AW packet (middle). After the IAW has decayed and propagated rightwards out of the domain, a backward-propagating AW packet has developed, with no apparent change to the wavenumber of the parent AW.

Figure 2

Figure 3. Energy contained within backward-propagating AWs after one Alfvén time of interaction between a forward-propagating monochromatic AW and a standing monochromatic IAW, determined numerically as a function of the ratio of the Alfvén and acoustic wavenumbers using Landau-fluid simulations (red pluses). An approximate analytical solution for the energy (3.1) is shown as the solid line. Overall good agreement between theory and simulation is found, demonstrating strong interaction when the wave frequencies are approximately matched.

Figure 3

Figure 4. (a) Energy spectrum of forward- and backward-propagating Alfvénic fluctuations during interaction with a standing IAW. (b) Total change in wave energy (orange) and imbalance (black), normalized to the initial Alfvénic fluctuation energy $\mathcal {E}_0$, versus time. In panel (a), the IAW–AW interaction primarily generates new AW fluctuations at $k_{\rm A}\pm k_{\rm C}$, exhibited by the steepness of the spectra outside of the interval $k \in [k_{\rm A}-k_{\rm C}, k_{\rm A}+k_{\rm C}]$. This explains the accuracy of (3.1), as fluctuations at $k_{\rm A} \pm 2k_{\rm C}$ are too weak to affect the solution dramatically. In panel (b), the change in energy imbalance is several orders of magnitude smaller than the increase in total energy of the Alfvénic fluctuations, demonstrating that the IAW–AW interaction decreases imbalance with respect to overall AW energy.

Figure 4

Figure 5. (a) Magnetic field and pressure anisotropy profiles of the steepened AW packet, and (b) relationship between the perpendicular pressure and the magnetic perturbation amplitude at the front of the shock. The steepened wave packet is led by a shock in the magnetic-field profile, with a smoothed pressure anisotropy profile and $\varDelta$ precursor modifying the local Alfvén speed. The dip in $\varDelta$ behind the wave packet results from the initial conditions of the AW not including $\varDelta$, hence the magnetic-field strength decreases in this region as the packet propagates away. As this packet propagates, the approximate conservation of the $T_\perp /B$ adiabat sets the amplitude of the decaying anisotropy, and thus $v_{{\rm A},\textrm {eff}}$, at the shock front.

Figure 5

Figure 6. Time slices of a Landau-fluid simulation of the the AW–AW packet interaction. The pressure anisotropy of the primary packet is evolved alongside the magnetic perturbation of the secondary packet. The packets are initially separated (top), yet over time, the enhanced effective speed allow the primary packet to approach the secondary from behind (middle). By the end of the simulation, the effective wavenumber of the secondary packet has more than doubled from the steepening induced by the primary packet's $\varDelta$ precursor.

Figure 6

Figure 7. Comparison of the analytically predicted and simulated decay lengths of the pressure anisotropy precursor of a steepened AW packet, for $\boldsymbol {q}_{\perp /\|}$ given by the ‘$3+1$’ heat fluxes (2.3). The calculated decay length $\lambda _{{\rm T},\perp }$ is normalized by the dominant wavelength of the AW packet $\lambda _{{\rm A}}$. On the left, the decay length is varied with respect to the Landau wavenumber $|k_\||$, while on the right, it is varied with respect to $\beta$.