Hostname: page-component-75d7c8f48-x9v92 Total loading time: 0 Render date: 2026-03-14T09:22:18.454Z Has data issue: false hasContentIssue false

A classification of mode-1 internal solitary waves in a three-layer fluid

Published online by Cambridge University Press:  10 March 2026

Ricardo Barros*
Affiliation:
Department of Mathematical Sciences, Loughborough University , Loughborough LE11 3TU, UK
Alex George Doak
Affiliation:
Department of Mathematical Sciences, University of Bath, Bath, BA2 7AY, UK
Wooyoung Choi
Affiliation:
Department of Mathematical Sciences, New Jersey Institute of Technology, Newark, NJ 07102-1982, USA
Paul A. Milewski
Affiliation:
Department of Mathematical Sciences, University of Bath, Bath, BA2 7AY, UK Department of Mathematics, The Pennsylvania State University, State College, PA 16802, USA
*
Corresponding author: Ricardo Barros, r.barros@lboro.ac.uk

Abstract

We explore the bifurcation structure of mode-1 solitary waves in a three-layer fluid confined between two rigid boundaries. A recent study Lamb (2023 J. Fluid Mech., vol. 962, A17) proposed a method to predict the coexistence of solitary waves with opposite polarity in a continuously stratified fluid with a double pycnocline by examining the conjugate states for the Euler equations. We extend this line of inquiry to a piecewise-constant three-layer stratification, taking advantage of the fact that the conjugate states for the Euler equations are exactly preserved by the strongly nonlinear model that we will refer to as the three-layer Miyata-Maltseva-Choi-Camassa (MMCC3) equations. In this reduced setting, solitary waves are governed by a Hamiltonian system with two degrees of freedom, whose critical points are used to explain the bifurcation structure. Through this analysis, we also discover families of solutions that have not been previously reported for a three-fluid system. Using the shared conjugate state structure between the MMCC3 model and the full Euler equations, we propose criteria for distinguishing the full range of solution behaviours. This alignment between the reduced and full models provides strong evidence that partitioning the parameter space into regions associated with distinct solution types is valid within both theories. This classification is further substantiated by numerical solutions to both models, which show excellent agreement.

Information

Type
JFM Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press

1. Introduction

Internal solitary waves (ISWs) in density stratified fluids have been the focus of extensive theoretical, numerical and experimental research. Their common occurrence in coastal oceans has prompted numerous field studies aimed at understanding their influence on mass transport and mixing within the global ocean (Moum et al. Reference Moum, Farmer, Smyth, Armi and Vagle2003; Shroyer, Moum & Nash Reference Shroyer, Moum and Nash2010). Oceanic ISWs can be modelled as solutions to the Euler equations for a continuous stratification that, in steady form, are known as the Dubreil–Jacotin–Long (DJL) equation (Dubreil-Jacotin Reference Dubreil-Jacotin1932; Long Reference Long1953). Although there are infinitely many modes of propagation in the ocean, as predicted by linear theory, most previous literature, both observational and theoretical, concerns waves of the first baroclinic mode (mode-1), which have the simplest vertical structure (see Helfrich & Melville Reference Helfrich and Melville2006 and references therein). Mode-1 solutions to the DJL equation have been computed for different stratifications by using various methods (Tung, Chan & Kubota Reference Tung, Chan and Kubota1982; Turkington, Eydeland & Wang Reference Turkington, Eydeland and Wang1991; Stastna & Lamb Reference Stastna and Lamb2002; Stastna Reference Stastna2022). These studies suggest that a wide range of solution behaviours can be expected. In particular, while certain stratification profiles lead to wave broadening as amplitude increases, others result in wave narrowing with growing amplitude (Lamb & Wan Reference Lamb and Wan1998). Additional phenomena, such as the coexistence of solutions with opposite polarity, were recently examined by Lamb (Reference Lamb2023) in the context of a double-pycnocline stratification. This configuration features two hyperbolic tangent functions that represent a smooth density profile with two sharp transitions. Lamb found that wave broadening occurred consistently across all physical parameters considered.

The double-pycnocline stratification profile can be idealised as a three-layer fluid system as depicted in figure 1, in which the densities of the fluids in each layer are constant. This simplified configuration facilitates analysis and has been extensively studied using weakly nonlinear theory, including the Korteweg-de Vries (KdV) equation (Yang et al. Reference Yang, Fang, Tang and Ramp2010) and the Gardner equation (Kurkina et al. Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015). Weakly nonlinear theory has revealed a variety of solution behaviours: mode-1 ISWs may appear as waves of elevation, waves of depression or as two distinct branches with opposite polarity. More importantly, it enables the delineation of regions in the parameter space where different types of solutions are present. However, concerns have been raised regarding the validity of these models when compared with the full Euler theory (Lamb & Yan Reference Lamb and Yan1996).

As an effort to understand the strongly nonlinear effects on ISWs in this system, Barros, Choi & Milewski (Reference Barros, Choi and Milewski2020) and Doak, Barros & Milewski (Reference Doak, Barros and Milewski2022) have adopted the strongly nonlinear theory, based on a long-wave model that extends the well-known (two-layer) Miyata–Maltseva–Choi–Camassa (MMCC) model to three layers. While previous studies have commonly referred to this model as the Miyata–Choi–Camassa (MCC) model (Miyata Reference Miyata1988; Choi & Camassa Reference Choi and Camassa1999), recent recognition of the contributions by Maltseva (Reference Maltseva1989) warrants this rectification. Accordingly, the model used in this work will be referred to as the three-layer MMCC (MMCC3) model. Although our previous investigations focused primarily on mode-2 solutions, we use it here to provide a classification of mode-1 solutions.

Despite being simpler than Euler, the MMCC3 model captures exactly the conjugate states in the sense of Benjamin (Reference Benjamin1971), which limit the amplitude of mode-1 solutions. This is a promising sign for the accuracy of the MMCC3 model in describing waves well beyond the weakly nonlinear regime, as established for the two-layer system by Camassa et al. (Reference Camassa, Choi, Michallet, Rusas and Sveen2006). Furthermore, as pointed out by Doak et al. (Reference Doak, Barros and Milewski2022), the conjugate states are related to the critical points of the dynamical system of the MMCC3 theory, a Hamiltonian system with two degrees of freedom. This offers a dramatic reduction in complexity compared with the Euler equations, which we exploit to shine light on the bifurcation structure of mode-1 solutions. We are able to provide a complete classification of mode-1 solitary waves for the MMCC3 system, covering all of the parameter space under the Boussinesq approximation. The analysis of the conjugate states is paramount to delimit regions in the parameter space where different types of solutions can be found. In particular, the criterion for the coexistence of solutions of opposite polarity established by Lamb (Reference Lamb2023) is recovered. Other criteria are proposed for other types of solutions, including cases in which multiple branches with the same polarity are observed. The corresponding profiles feature an extra bump on the broadened section, and a comparable solution was reported for a double-pycnocline stratification by Dunphy, Subich & Stastna (Reference Dunphy, Subich and Stastna2011), albeit with a background shear current in the far field. Waves of this type, exhibiting multiscale structure, were coined pyramidal solitons by Pelinovsky, Tatiana & Tarmo (Reference Pelinovsky, Tatiana and Tarmo2021), and have also been predicted, using asymptotic approximations to the DJL equation, in continuously stratified fluids characterised by small density gradients by Makarenko, Maltseva & Kazakov (Reference Makarenko, Maltseva and Kazakov2009) and Derzho (Reference Derzho2017).

The paper is structured as follows. In § 2 we introduce the models examined in the paper. In § 3 we provide an overview of the main findings. In § 4 we examine the properties of the critical points of the MMCC3 system, with particular emphasis on their collision. In § 5 we use these findings to predict different bifurcation structures and provide a complete description of the solution space, under Boussinesq approximation. In § 6 we present and compare numerical solutions from both the MMCC3 and full Euler models. Concluding remarks are given in § 7.

2. Preliminaries

We consider long waves in a physical system composed of three fluid layers, bounded above and below by a rigid flat surface, as depicted in figure 1. The stratification is assumed gravitationally stable, i.e. the constant densities $\rho _i$ for $i=1,2,3$ (from top to bottom) satisfy $\rho _1\lt \rho _2 \lt \rho _3$ . The location of the upper and lower interfaces are defined by $z=\eta _2 \equiv \zeta _1 (x,t)$ and $z=\eta _3 \equiv -H_2 + \zeta _2 (x,t)$ , respectively. The two rigid walls are located at $z=\eta _1 \equiv H_1$ and $z=\eta _4 \equiv -(H_2+H_3)$ . The thickness of the $i$ th layer $h_i$ is then given by $h_i=\eta _i-\eta _{i+1}$ . More precisely,

(2.1) \begin{equation} h_1= H_1 - \zeta _1, \quad h_2 = \zeta _1 - \zeta _2 + H_2, \quad h_3 = H_3 + \zeta _2, \end{equation}

where $H_i$ denotes the undisturbed thickness of the $i$ th layer. The total depth of the water column will be denoted by $H$ and defined by $H=H_1+H_2+H_3$ .

Figure 1. A three-fluid system.

Assuming the fluid in each layer is inviscid and incompressible, the velocity vector $\boldsymbol{u_i}(x,z,t)=(u_i,w_i)$ in each layer is governed by the Euler equations

(2.2) \begin{align} \rho _i \frac {D_i\boldsymbol{u_i}}{{D}t} &= -\boldsymbol{\nabla }\!p_i - \rho _i g \boldsymbol{e_z}, \qquad\qquad i=1,2,3, \end{align}
(2.3) \begin{align} \boldsymbol{\nabla }\boldsymbol{\cdot }\boldsymbol{u_i} &=0, \qquad\qquad\qquad\qquad\qquad \!\!\! i=1,2,3, \end{align}

where $p_i$ is the pressure in layer $i$ , the operator $D_i/{D}t(=\partial _t + \boldsymbol{u_i}\boldsymbol{\cdot }\boldsymbol{\nabla })$ is the material derivative with respect to the velocity field in layer $i$ , $g$ is the acceleration due to gravity and $\boldsymbol{e_z}$ is the unit vector in the $z$ direction. On the top and bottom walls, we have no normal flow, that is,

(2.4) \begin{align} \boldsymbol{u_3}\boldsymbol{\cdot }\boldsymbol{e_z} &=0 \qquad\qquad\text{at }\quad z=\eta _4, \end{align}
(2.5) \begin{align} \boldsymbol{u_1}\boldsymbol{\cdot }\boldsymbol{e_z}&=0 \qquad\qquad\text{at }\quad z=\eta _1 . \end{align}

On the a priori unknown interfaces, we have both kinematic boundary conditions, given by

(2.6) \begin{align} \frac {D_{i-1}(\eta _i-z)}{{D}t} &=0 \qquad\qquad \text{at } \quad z=\eta _i, \quad i=2,3, \end{align}
(2.7) \begin{align} \frac {D_{i}(\eta _i-z)}{{D}t} &=0 \qquad\qquad \text{at }\quad z=\eta _i, \quad i=2,3, \end{align}

and continuity of pressure, i.e.

(2.8) \begin{align} p_i =p_{i-1} \qquad\qquad \text{at }\quad z=\eta _i, \quad i=2,3. \end{align}

We impose that $\boldsymbol{u_i}$ and $\eta _i$ decay to zero as $|x|\rightarrow \infty$ . The fully nonlinear system (2.2)–(2.8) is referred to as the Euler system throughout this paper.

To examine large amplitude long waves in this system, we also adopt the strongly nonlinear multilayer model proposed by Choi (Reference Choi2000), which, starting from the Euler system above, under weak horizontal vorticity and long-wave ( $\epsilon = H_i/\lambda \ll 1$ ) assumptions, is formulated in terms of the thicknesses of each layer, denoted by $h_i$ , and depth-averaged horizontal velocities $\overline {u}_i$ . Namely, consisting of the mass conservation laws for each layer,

(2.9) \begin{equation} h_{i,t} + (h_i \,\overline {u}_i)_x =0, \end{equation}

and the momentum equations,

(2.10) \begin{equation} {\overline u}_{i,t} +{\overline u}_i{\overline u}_{i,x}+g \eta _{i,x}+{P_{i,x}\over \rho _i} ={1\over h_i} \left ({\frac {1}{3}} h_i^3 a_i+\frac {1}{2} h_i^2 b_i \right )_x +\left (\frac {1}{2} h_i a_i + b_i \right )\,{\eta _{i+1,x}}, \end{equation}

where $a_i(x,t)=-(D_i^2 h_i)/h_i$ and $b_i(x,t) = - D_i^2 \eta _{i+1}$ with $D_i=\partial /\partial t+ {\overline u_i}\, \partial /\partial x$ . The pressure at the location $z=\eta _i (x,t)$ denoted by $P_i$ satisfies the recursion formula

(2.11) \begin{equation} P_{i+1} = P_i + \rho _i \left (g h_i -\frac {1}{2} a_i h_i^2 - b_i h_i \right )\!. \end{equation}

The system of equations composed of (2.9) and (2.10) for $i=1,2,3$ and (2.11) for $i=1,2$ , along with a geometric constraint given by $h_1+h_2+h_3=H_1+H_2+H_3=H$ will be referred to as the MMCC3 model.

According to linear theory, there exist two propagation modes in each direction, known as mode-1 (faster) and mode-2 (slower) according to the magnitude of the wave speed. Linear long-wave speeds $c_0^\pm$ ( $0\lt c_0^-\lt c_0^+$ ) are obtained as the roots of the equation

(2.12) \begin{align} \big [ (\rho _1 H_2 + \rho _2 H_1)c^2 - g (\rho _2-\rho _1) H_1 H_2 \big ] \big [ (\rho _2 H_3 + \rho _3 H_2) c^2 &- g (\rho _3-\rho _2) H_2 H_3 \big ] \nonumber \\ &\quad -\rho _2^2 H_1 H_3 \,c^4 =0, \end{align}

and, for linear waves, the ratio between the two interface displacements can be found as

(2.13) \begin{equation} \zeta _2/\zeta _1 = \gamma ^\pm , \end{equation}

where $\pm$ refers to the propagation mode considered and $\gamma ^\pm$ is obtained from either equality:

(2.14) \begin{equation} \gamma ^\pm = 1+\frac {\rho _1 H_2}{\rho _2 H_1} - g \left ( \frac {\rho _2-\rho _1}{\rho _2}\right ) \frac {H_2}{{c_0^\pm }^2} = \left [1+\frac {\rho _3 H_2}{\rho _2 H_3} - g \left ( \frac {\rho _3-\rho _2}{\rho _2}\right ) \frac {H_2}{{c_0^\pm }^2} \right ]^{-1}\!. \end{equation}

An alternate form of the equation for the linear long-wave speed (2.12) can be written as

(2.15) \begin{equation} \left [ 1+\frac {\rho _1 H_2}{\rho _2 H_1} - g \left ( \frac {\rho _2-\rho _1}{\rho _2}\right ) \frac {H_2}{c^2} \right ] \left [ 1+\frac {\rho _3 H_2}{\rho _2 H_3} - g \left ( \frac {\rho _3-\rho _2}{\rho _2}\right ) \frac {H_2}{c^2} \right ] =1. \end{equation}

In addition, we have $\gamma ^+\gt 0$ and $\gamma ^-\lt 0$ (see, e.g. Barros et al. Reference Barros, Choi and Milewski2020).

It will be assumed, without loss of generality, that the stratification is given as

(2.16) \begin{equation} \rho _1 = \rho _0 (1-\varDelta _1), \quad \rho _2=\rho _0, \quad \rho _3 = \rho _0 (1+ \varDelta _2), \end{equation}

with $0\lt \varDelta _1\lt 1$ , $\varDelta _2\gt 0$ . Furthermore, we introduce the parameter $\delta$ as the ratio between the two density increments:

(2.17) \begin{equation} \delta =\varDelta _1/\varDelta _2. \end{equation}

2.1. Boussinesq approximation and upside-down symmetry

The Boussinesq approximation is commonly adopted in the study of stratified fluids when the density variation is small. For the physical system under consideration, under the Boussinesq approximation, both $\varDelta _1 = (\rho _2-\rho _1)/\rho _2$ and $\varDelta _2 = (\rho _3-\rho _2)/\rho _2$ are assumed small ( $\ll 1$ ) and of the same order of magnitude, and can be neglected except where they appear in terms multiplied by $g$ . Under this approximation, the linear long-wave speeds $c_0$ are roots of

(2.18) \begin{equation} \big [ (H_1+H_2 )c^2 - g_1^\prime H_1 H_2 \big ] \big [ ( H_2 + H_3) c^2 - g_2^\prime H_2 H_3 \big ] - H_1 H_3 \,c^4 =0, \end{equation}

and $\gamma$ is defined accordingly as

(2.19) \begin{equation} \gamma ^\pm = 1+ \frac {H_2}{H_1} - \frac {g_1^\prime H_2}{{c_0^\pm }^2} = \left [ 1+\frac {H_2}{H_3} - \frac {g_2^\prime H_2}{{c_0^\pm }^2} \right ]^{-1}\!. \end{equation}

Here, $g_1^\prime = g \varDelta _1$ and $g_2^\prime = g \varDelta _2$ denote reduced gravities (the notation $g^\prime$ will be used unambiguously in the case when $\varDelta _1=\varDelta _2$ ).

We note that, under the Boussinesq approximation, both the Euler and MMCC3 systems are invariant under the so-called ‘upside-down’ symmetry (see Guan et al. Reference Guan, Doak, Milewski and Vanden-Broeck2024 and Barros et al. Reference Barros, Choi and Milewski2020, respectively). For every steady solution pair $(\zeta _1,\zeta _2)$ of a given configuration with densities $\rho _0 (1-\varDelta _1)$ , $\rho _0$ , $\rho _0 (1+\varDelta _2)$ and undisturbed thickness of layers $H_1$ , $H_2$ , $H_3$ (from top to bottom), $(-\zeta _2, -\zeta _1)$ is a solution of the modified physical configuration with densities $\rho _0 (1-\varDelta _2)$ , $\rho _0$ , $\rho _0 (1+\varDelta _1)$ , where the undisturbed thickness of layers are $H_3$ , $H_2$ , $H_1$ (from top to bottom).

In the remaining part of this section, we discuss the characterisation of solitary waves provided by weakly nonlinear theory, with particular emphasis on mode-1, also known as the first baroclinic mode.

2.2. Korteweg-de Vries theory

If a weakly nonlinear theory is adopted, by considering unidirectional waves in the limit of $\alpha = a/H_i=O(H_i^2/\lambda ^2)=O(\epsilon ^2)$ ( $\epsilon \ll 1$ ), with $a$ and $\lambda$ typical values of amplitude and wavelength, a KdV equation can be derived for the upper interface $z=\zeta _1 (x,t)$ , i.e.

(2.20) \begin{equation} \zeta _{1,t} + c_0 \,\zeta _{1,x} + c_1 \,\zeta _1 \,\zeta _{1,x} + c_2 \,\zeta _{1,xxx}=0, \end{equation}

with coefficients $c_1$ and $c_2$ given by

(2.21) \begin{align} c_1 & = \frac {3}{2} c_0 \frac {\dfrac {\rho _3}{H_3^2} \gamma ^3 + \dfrac {\rho _2}{H_2^2} (1-\gamma )^3 -\dfrac {\rho _1}{H_1^2}}{\dfrac {\rho _3}{H_3} \gamma ^2 + \dfrac {\rho _2}{H_2} (1-\gamma )^2+\dfrac {\rho _1}{H_1}}, \\[-12pt]\nonumber \end{align}
(2.22) \begin{align} c_2 & = \frac {1}{6} c_0 \frac {\rho _3 H_3 \gamma ^2 + \rho _2 H_2 (1+\gamma +\gamma ^2)+\rho _1 H_1}{\dfrac {\rho _3}{H_3} \gamma ^2 + \dfrac {\rho _2}{H_2} (1-\gamma )^2+\dfrac {\rho _1}{H_1}}, \end{align}

where $\gamma$ is defined in (2.14). Within the KdV theory, the propagation modes are fully decoupled, and solutions for mode-1 (mode-2) are obtained by replacing $c_0$ by $c_0^+$ ( $c_0^-$ ) in (2.20) and (2.14). For each mode, ISWs exist for (2.20) as long as their wave speed $c$ is greater than $c_0$ , and their polarity depends strictly on the sign of the quadratic nonlinearity coefficient $c_1$ ( $c_2\gt 0$ regardless of the physical parameters considered): the ISW is of elevation (depression) for $\zeta _1$ if $c_1\gt 0$ ( ${\lt } 0$ ). On the other hand, according to the KdV theory, no ISWs exist in the case when the coefficient $c_1$ vanishes, i.e. when

(2.23) \begin{equation} \frac {\rho _3}{H_3^2} \gamma ^3 + \frac {\rho _2}{H_2^2} (1-\gamma )^3 - \frac {\rho _1}{H_1^2} =0, \end{equation}

often referred to as the criticality condition. Typically, and regardless of the propagation mode considered, for given densities $\rho _i$ ( $i=1,2,3$ ) and a fixed ratio $H_1/H_3$ , criticality is attained at one single point in the parameter space on the $(H_1/H, H_2/H)$ -plane. An exception arises when the stratification satisfies $\rho _2/\rho _1= \rho _3/\rho _2$ . In this unique case, if mode-1 is considered, criticality is maintained along the entire line $H_2/H = - ( 1+(\rho _3/\rho _1)^{1/2} ) H_1/H + 1$ (see Appendix B in Barros et al. Reference Barros, Choi and Milewski2020).

We remark that, alternatively, a KdV equation could have been written for the lower interface displacement $\zeta _2(x,t)$ :

(2.24) \begin{equation} \zeta _{2,t} + c_0 \,\zeta _{2,x} + \tilde {c}_1 \,\zeta _2 \,\zeta _{2,x} + c_2 \,\zeta _{2,xxx}=0. \end{equation}

Here $\tilde {c}_1 = c_1/\gamma$ and the equation can be easily recovered from (2.20) by setting $\zeta _1 = \zeta _2/\gamma$ .

Figure 2. (a) Shows the locus in the parameter space where the criticality condition (2.23) is satisfied, under the Boussinesq approximation with $\varDelta _1=\varDelta _2$ . The criticality curve (in blue), along which no ISWs exist according to the KdV theory, splits up into two components on the $(H_1/H, H_2/H)$ -plane, one of which is the line $H_2/H=-2 H_1/H + 1$ , corresponding to a fixed ratio $H_1/H_3=1$ . Also marked on this line is the point with coordinates $(9/26,4/13)$ at which a transition from non-existence to the existence of solitary waves to the mKdV equation (2.25) occurs. This is made clearer in (b) where the mKdV theory is adopted by relaxing the constraint $H_1=H_3$ , for which the locus (in thick black) for the coexistence of ISWs of opposite polarity is revealed.

To obtain the expression of the KdV coefficients under the Boussinesq approximation, as well as the corresponding criticality condition, it suffices to set $\rho _1 = \rho _2 = \rho _3 = \rho _0$ , along with $c_0$ defined implicitly by (2.18), and consider $\gamma$ defined by (2.19) instead of (2.14). By doing so, we recover the KdV equation presented by Yang et al. (Reference Yang, Fang, Tang and Ramp2010), based on the work by Benney (Reference Benney1966) (see also Benjamin Reference Benjamin1966; Grimshaw Reference Grimshaw1981).

It is well known that when the Boussinesq approximation is adopted, with equal density increments ( $\varDelta _1=\varDelta _2$ ) and the same undisturbed depths for the top and bottom layers ( $H_1=H_3$ ), then mode-1 solitary waves cannot exist according to KdV theory. The reason being that, for these parameters, we have $\gamma ^+=1$ , which trivially satisfies the criticality condition. On the other hand, if the modified KdV (mKdV) equation

(2.25) \begin{equation} \zeta _{1,t} + c_0^+ \,\zeta _{1,x} + \frac {1}{12} c_0^+ H_1(2H_1 + 3H_2) \,\zeta _{1,xxx} - \frac {1}{8 H_1^3} c_0^+ (8H_1 -9H_2) (\zeta _1^3)_x =0 \end{equation}

is used for the first baroclinic mode (with $c_0^+= \sqrt {g^\prime H_1}$ ), different predictions can be made (see Talipova et al. Reference Talipova, Pelinovsky, Lamb, Grimshaw and Holloway1999). When $H_1\gt (9/8) \,H_2$ (or equivalently $H_2/H\lt 4/13$ along the line $H_2/H = -2 H_1/H+1$ , as in figure 2 $a$ ), no mode-1 ISWs exist according to the mKdV theory, in agreement with the KdV theory. In contrast, when $H_1 \lt (9/8) \,H_2$ , two branches of solutions with opposite polarity can be predicted by the mKdV theory (see Grimshaw, Pelinovsky & Talipova Reference Grimshaw, Pelinovsky and Talipova1997). We note that for the same density stratification, i.e. $\delta =1$ , a mKdV equation can be written by relaxing the constraint $H_1=H_3$ . This equation is defined solely along the criticality curve. Hence, the locus for the coexistence of ISWs of opposite polarity, according to the mKdV equation, is a subset of the criticality curve. In figure 2 $(b)$ this subset is depicted in black. In the remaining subset (in blue) of the criticality curve, no ISWs can exist according to the theory.

We remark that, under the Boussinesq approximation, the KdV theory carries over the upside-down symmetry of the Euler equations. This results from (2.20) and (2.24) and the conditions

(2.26) \begin{align} c_0(H_3,H_2,H_1,1/\delta ) &= c_0(H_1,H_2,H_3,\delta ), \\[-12pt]\nonumber \end{align}
(2.27) \begin{align} c_1(H_3,H_2,H_1,1/\delta ) &= -\tilde {c}_1(H_1,H_2,H_3,\delta ), \\[-12pt]\nonumber \end{align}
(2.28) \begin{align} c_2(H_3,H_2,H_1,1/\delta ) &= c_2(H_1,H_2,H_3,\delta ), \end{align}

which are all satisfied throughout the entire parameter space.

2.3. Gardner theory

Following Choi & Camassa (Reference Choi and Camassa1999), we can extend the KdV equation (2.20) to various orders in an attempt to better describe larger amplitude waves. Namely, the Gardner equation:

(2.29) \begin{align} \zeta _{1,t} +c_0 \,\zeta _{1,x} + c_1 \zeta _1 \zeta _{1,x} + c_2 \,\zeta _{1,xxx} + c_3 \big(\zeta _1^3\big)_x = 0. \end{align}

This model is a truncation that mixes terms of different orders and, contrary to KdV, the terms may not balance as $\epsilon \rightarrow 0$ .

As expected, the coefficients $c_1$ , $c_2$ are the same as for the KdV equation, given by (2.21) and (2.22), respectively. The cubic nonlinearity coefficient $c_3$ is found (see Appendix A) as

(2.30) \begin{equation} c_3= \frac {T_1+T_2+T_3}{\dfrac {\rho _3}{H_3}\gamma ^2+\dfrac {\rho _2}{H_2}(1-\gamma )^2+\dfrac {\rho _1}{H_1}}, \end{equation}

with $T_1$ , $T_2$ , $T_3$ given by

(2.31) \begin{align} T_1 &=-\frac {1}{4} c_0 \Bigg [ 9 \frac {H_2}{\rho _2}\Bigg ( \frac {\rho _3}{H_3^2}\gamma ^2-\frac {\rho _2}{H_2^2}(1-\gamma )^2\Bigg )\Bigg (\frac {\rho _2}{H_2^2}(1-\gamma )^2-\frac {\rho _1}{H_1^2}\Bigg ) \nonumber \\ & \quad +\, 4\Bigg ( \frac {\rho _3}{H_3^3}\gamma ^4+\frac {\rho _2}{H_2^3}(1-\gamma )^4+\frac {\rho _1}{H_1^3}\Bigg )\Bigg ], \end{align}
(2.32) \begin{align} T_2 &=-\frac {1}{6} \frac {c_1}{\rho _2} \Bigg [ 10 \frac {\rho _2^2}{H_2^2}(1-\gamma )^3+\frac {3\rho _1}{H_1} \frac {\rho _3}{H_3} \left ( \frac {2 H_2}{H_1}-3\frac {H_2}{H_3}\gamma \right ) \,\gamma + 3\frac {\rho _2}{H_2}(1-\gamma )^2 \nonumber \\ & \quad \times\, \left ( 3 \frac {\rho _1}{H_1}-2\frac {\rho _3}{H_3}\gamma \right ) +\frac {\rho _1}{H_1}\frac {\rho _2}{H_1} (-1+6\gamma )+ \frac {\rho _2}{H_3} \frac {\rho _3}{H_3} \gamma ^2 \,(-9+4\gamma )\Bigg ], \end{align}
(2.33) \begin{align} T_3 &=-\frac {1}{6}\frac {c_1^2}{c_0} \Bigg [ -3\left ( \frac {\rho _3}{H_3}\gamma ^2+\frac {\rho _2}{H_2}(1-\gamma )^2+\frac {\rho _1}{H_1}\right ) \nonumber \\ & \quad +\, \frac {4 \gamma }{\rho _2 H_1 H_3} (\rho _1 \rho _2 H_3+\rho _1 \rho _3 H_2 +\rho _2 \rho _3 H_1)\Bigg ]. \end{align}

As before, to recover the expression of the Gardner coefficients under the Boussinesq approximation (see Kurkina et al. Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015), it suffices to set $\rho _1 = \rho _2 = \rho _3 = \rho _0$ , along with $c_0$ defined implicitly by (2.18), and consider $\gamma$ defined by (2.19). Here, the cubic nonlinearity coefficient $c_3$ can change sign depending on the physical parameters considered. Given that the coefficient $c_2$ of the linear dispersive term $\zeta _{1,xxx}$ is always positive, it is well known that when $c_3\gt 0$ waves of either polarity exist with no maximum amplitude. Minimum wave amplitudes exist for waves of elevation (depression) for $c_1\lt 0$ ( ${\gt } 0$ ). Specifically, these have a minimum amplitude of $-2c_1/(3 c_3)$ . On the other hand, when $c_3\lt 0$ , a single branch of solutions exists with the same polarity as that predicted by the KdV equation, with the important difference that such solutions now broaden with increasing wave speeds (or amplitude), until a front is reached (see, e.g. Grimshaw, Pelinovsky & Talipova Reference Grimshaw, Pelinovsky and Talipova1999). A schematic illustration of all solution behaviours is given in figure 3 $(b)$ following Grimshaw et al. (Reference Grimshaw, Pelinovsky and Talipova1999).

Figure 3. Behaviour of the quadratic and cubic nonlinearity coefficients ( $c_1$ , $c_3$ , respectively) of the Gardner equation (2.29) for the upper interface displacement $\zeta _1(x,t)$ , when restricted to mode-1 waves. The stratification is given by (2.16) with $\varDelta _1=\varDelta _2=0.1$ . It is shown how the signs for the nonlinearity coefficients may vary across the whole parameter space. The colour scheme shown in panel (b) and used in panel (a) is the same as that set by Kurkina et al. (Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015), and allows us to easily distinguish the various solution behaviours. Shaded regions in (b) represent a minimum amplitude along a solution branch.

Kurkina et al. (Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015) have examined in great detail how the quadratic and cubic nonlinearity coefficients can vary with the different physical parameters considered in a three-layer fluid, under the Boussinesq approximation. In particular, it was highlighted through extensive numerical tests that the coefficient $c_3$ could never change sign for mode-2 waves, and remained negative. As a consequence, Gardner theory predicts that the coexistence of ISWs of opposite polarity can only occur for mode-1 waves. As in Kurkina et al. (Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015), when graphically presenting the parameter space for different solutions, we may fix the density increments $\Delta _i$ and show the result of varying the undisturbed depths $H_i$ . The domain of physically permissible parameters is then a triangle with vertices $(0,0)$ , $(0,1)$ and $(1,0)$ in the $(H_1/H,H_2/H)$ -space. We show in figure 3 how the sign for the quadratic and cubic nonlinearity coefficients changes with $\varDelta _1=\varDelta _2=0.1$ . If we consider $\varDelta _1=\delta \varDelta _2$ , with $\delta \neq 1$ and $\varDelta _2=0.1$ , and draw similar pictures, we observe that, despite the somewhat large density contrasts considered, the results obtained are in good agreement with those obtained by Kurkina et al. (Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015), reproduced here in figure 4. Note that when the Boussinesq approximation is considered, sweeping the parameter space becomes easier, since it is defined only by the three parameters $\delta$ , $H_1/H$ and $H_2/H$ . We can infer from the figures that given fixed densities, the coexistence of solutions of opposite polarity can be predicted within a limited range of values of $H_2/H$ , or none at all, depending on the ratio between the undisturbed thicknesses of the top and bottom layer (see figure 6).

Figure 4. Behaviour of the quadratic and cubic nonlinearity coefficients ( $c_1$ , $c_3$ , respectively) of the Gardner equation (2.29) for the upper interface displacement $\zeta _1(x,t)$ , when restricted to mode-1 waves under the Boussinesq approximation. Different values of $\delta$ are considered, and the same colour scheme as for figure 3 is adopted.

Figure 5. Behaviour of the quadratic and cubic nonlinearity coefficients ( $\tilde {c}_1$ , $\tilde {c}_3$ , respectively) of the Gardner equation (2.34) for the lower interface displacement $\zeta _2(x,t)$ , when restricted to mode-1 waves under the Boussinesq approximation. Different values of $\delta$ are considered, and the colour scheme remains consistent with that of figure 3; however, the coordinate axes have been redefined.

We remark that, alternatively, a Gardner equation could have been written for the lower interface displacement $\zeta _2(x,t)$ :

(2.34) \begin{equation} \zeta _{2,t} + c_0 \,\zeta _{2,x} + \tilde {c}_1 \,\zeta _2 \,\zeta _{2,x} + c_2 \,\zeta _{2,xxx} + \tilde {c}_3 \big(\zeta _2^3 \big)_x=0. \end{equation}

As shown in Kurkina et al. (Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015), under the Boussinesq approximation, the following relationship holds:

(2.35) \begin{equation} c_3(H_3,H_2,H_1,1/\delta ) = \tilde {c}_3(H_1,H_2,H_3,\delta ). \end{equation}

This shows that the Gardner theory does preserve the upside-down symmetry of the Euler equations. Consequently, figure 4 $(c)$ can be transformed into figure 5 $(a)$ using the mapping $(x,y)\mapsto (1-x-y,y)$ , with the caveat of a reversed colour scheme (green replacing blue and orange replacing red), as we would be describing the behaviour of $-\zeta _2(x,t)$ . What is intriguing is that figures 5 $(a)$ and 4 $(a)$ exhibit clear discrepancies, suggesting that the predicted solution behaviour may depend on which form of the Gardner equation, (2.29) or (2.34), is used; cf. Kurkina et al. (Reference Kurkina, Kurkin, Rouvinskaya and Soomere2015). Similar considerations apply to other panels in figure 4.

Figure 6. Vanishing of the quadratic (in blue) and cubic (in purple) nonlinearity coefficients ( $c_1$ , $c_3$ , respectively) of the Gardner equation (2.29) for $\zeta _1(x,t)$ , when restricted to mode-1 waves under Boussinesq approximation. The parameter $\delta$ is set to 1, as in figure 4( $b$ ). The grey lines from the top vertex to the base of the triangle correspond to different ratios $H_1/H_3$ . Note that a fixed ratio $H_1/H_3=m$ determines the line with the equation $H_2/H = -(1+m^{-1} ) H_1/H +1$ in the parameter space. From left to right we have: $H_1/H_3=0.8, \, \approx 2.5155, \,20$ . For $H_1/H_3\gt 2.5155$ , according to the Gardner theory for $\zeta _1$ no ISWs of opposite polarity can coexist. This will be shown to be in disagreement with the predictions by the MMCC3 and Euler theories (e.g. see figure 9 $a$ ).

Before moving to the next section, we make additional remarks regarding the non-uniqueness of the Gardner equations: (i) other physical variables could be used in the expansion (such as the mean layer horizontal velocity) leading to yet a different solution behaviour in parts of parameter space; (ii) despite this non-uniqueness, physical considerations may, in some cases, be invoked to guide the selection of an appropriate Gardner model for specific parameter regimes; (iii) the curves corresponding to criticality coincide in the corresponding panels of figures 4 and 5, reflecting the fact that predictions for the solution behaviour of the KdV equation are uniquely defined and do not depend on the choice of dependent variable; (iv) at criticality, the subset of the parameter space in which the cubic nonlinearity coefficient is non-negative is likewise uniquely determined, irrespective of the particular Gardner equation adopted; (v) finally, and most importantly, we note that even when different Gardner formulations agree, this agreement does not, as will be demonstrated in the following sections, guarantee an accurate description of the strongly nonlinear or Euler results.

Figure 7. Possible solution spaces for solitary waves with speeds $c\gt c_0^+$ . Here, all solution branches are limited in amplitude. For any set of parameters, the bifurcation of mode-1 waves for the Euler and MMCC3 systems is described by one of the panels A–F. Furthermore, for $H_2$ sufficiently large, one finds mode-2 solitary waves, with a bifurcation structure described in Doak et al. (Reference Doak, Barros and Milewski2022), represented here by panel $M2$ . When applicable, shaded regions represent a minimum amplitude along a solution branch. Panels A and B show two branches of solitary waves of opposite polarity, both evolving into tabletop solitary waves. Panel B has a finite amplitude bifurcation for the depression branch, while the elevation branch bifurcates from zero amplitude. The opposite is true for panel A. Panels C and D show a single solitary wave branch of depression and elevation, respectively, developing into a tabletop solitary wave. Panels E and F are new bifurcation structures. There exist two branches, both of depression (panel E, $c_1\lt 0$ ) or elevation (panel F, $c_1\gt 0$ ). One branch (shown in black) bifurcates from zero amplitude and limits to a tabletop solitary wave. A second branch (shown in blue) exists with speeds strictly greater than those on the first branch. The branch starts at the same limiting tabletop solitary wave as the first branch, but the corresponding profiles include an additional bump on the broadened section. As one proceeds along the branch via continuation, the broadened section gradually diminishes until the solution takes the form of a conventional solitary wave. As the amplitude continues to increase, the wave broadens again and approaches a different tabletop solitary wave with a larger limiting amplitude. These new branches are described in greater detail in § 6.

3. Highlights of the main results

Having presented the three-layer models of KdV, mKdV and Gardner in their generality, we outline here the key findings towards a comprehensive classification of mode-1 ISWs in a three-layer fluid with the Euler and MMCC3 theories. Figure 7 provides a schematic representation of the full range of solutions for this physical configuration. Symmetry about the wave crest is assumed in all our numerical investigations, and the qualitative description applies to all the solutions with speeds $c\gt c_0^+$ found within both the strongly and fully nonlinear theories. We show only the upper interface $\zeta _1$ , as for each solution the lower interface has a comparable shape, with the exception of panel $M2$ . For any set of parameters the solution space for mode-1 waves will behave like one of the panels AF (excluding the case when there are no mode-1 solitary waves). The panels $C$ and $D$ are the simplest, each featuring a single branch, which can be of elevation (panel $D$ ) or depression (panel $C$ ). Both branches emerge from zero amplitude at the linear long-wave speed $c = c_0^+$ and evolve into tabletop solitary waves as the speed increases, ultimately approaching a front. Such behaviour is observed for mode-1 solitary waves in a two-layer fluid (see Choi & Camassa Reference Choi and Camassa1999 and Evans et al. Reference Evans and Ford1996 for the strongly and fully nonlinear theories, respectively) and the polarity is the same as the KdV polarity, determined by the sign of the quadratic nonlinearity coefficient. In panels $A$ and $B$ we see two branches, both bifurcating from the linear long-wave speed. However, in panel $A$ the elevation branch bifurcates from finite amplitude (the smallest amplitude wave being shown by the shaded region and the solutions along this branch are shown in blue), while the depression branch bifurcates from zero amplitude. The situation is similar in panel $B$ , but with the roles of the elevation and depression branch reversed. Similarly to the Gardner theory, the branches of solutions bifurcating from zero amplitude are those with the KdV polarity.

The behaviour shown in panels AD share several characteristics with the mode-1 solutions of the Gardner equation, as illustrated in figure 3 $(b)$ . Although the sign for the quadratic nonlinearity coefficient $c_1$ remains crucial in distinguishing the different solution properties within the strongly and fully nonlinear theories, the sign for the cubic nonlinearity coefficient $c_3$ does not. Furthermore, all branches of solutions have tabletop solitary waves ending in fronts, in contrast with the Gardner theory, where multiple solitary branches – when present – do not have a limiting amplitude. We also note that the behaviour observed in panels AD was also reported in a continuous double-pycnocline stratification by Lamb (Reference Lamb2023).

Panels $E$ and $F$ exhibit novel solution space behaviour with no direct counterpart in Gardner theory. In these panels, two branches of the same polarity are present, both of depression in panel $E$ and both of elevation in panel $F$ . However, there is no true multiplicity of solutions: one branch emerges from zero amplitude and gradually approaches a limiting wavefront, while the other bifurcates at a finite amplitude with higher propagation speeds than the first branch. A more detailed discussion of these solution branches is provided in § 6.

Finally, panel $M2$ shows mode-2 solitary waves with speeds exceeding the linear long-wave speed of mode-1 $c_0^+$ . Such solutions were found to exist in the strongly nonlinear theory by Barros et al. (Reference Barros, Choi and Milewski2020), and were explored in more detail in both the strongly nonlinear and full Euler theories by Doak et al. (Reference Doak, Barros and Milewski2022). For sufficiently large $H_2$ , these solutions also exist, in conjunction with mode-1 solutions with behaviours from panels $A$ or $B$ .

The results in this paper primarily stem from a conjugate state analysis for the Euler equations. According to Benjamin (Reference Benjamin1966), two horizontally uniform flows are considered conjugate if all three basic physical conservation laws of mass, momentum and energy are satisfied. Conveniently, the conjugate states of the Euler equations remain fully preserved within the MMCC3 theory, for which the study of steady solutions is significantly simpler. Within this reduced framework, it will be seen how the critical point analysis can be used to explain different and new bifurcation structures in the solutions, which are then validated using the Euler equations. A glimpse of the results is given in figures 17 and 18. All technical details are left to subsequent §§ 4, 5 and 6.

3.1. Solution space and conjugate states

The MMCC3 model has recently been examined by Barros et al. (Reference Barros, Choi and Milewski2020) and Doak et al. (Reference Doak, Barros and Milewski2022). Notably, Barros et al. (Reference Barros, Choi and Milewski2020) have shown that ISWs are governed by a Hamiltonian system with two degrees of freedom, for which the potential $V=V(\zeta _1,\zeta _2)$ is a rational function (see (4.3)). Furthermore, Doak et al. (Reference Doak, Barros and Milewski2022) established that the solutions of

(3.1) \begin{equation} V=0, \quad \frac {\partial V}{\partial \zeta _1}=0, \quad \frac {\partial V}{\partial \zeta _2}=0 \end{equation}

correspond to uniform flows to the Euler equations, which are conjugate. This problem was first addressed by Lamb (Reference Lamb2000), who arrived to a set of three algebraic equations – (12a), (12b), (14) therein – equivalent to our system (3.1). To solve the system, Lamb (Reference Lamb2000) employed a geometric approach. However, a more insightful procedure involves plotting branches of conjugate state solutions of (3.1) across different parameter regimes (see, e.g. figure 5 in Doak et al. Reference Doak, Barros and Milewski2022), following the work of Dias & Il’ichev (Reference Dias and Il’ichev2001) and Barros (Reference Barros2016). This approach is particularly useful under the Boussinesq approximation, as it allows us to analyse how the number and speed of conjugate state solutions change with increasing values of $H_2/H$ for fixed $\delta$ and $H_1/H_3$ , as illustrated in figure 8. The black dashed lines represent the curves $c=c_0^\pm$ , corresponding to the linear long-wave speeds, and for small values of $H_2/H$ , a single branch of solutions with $c\gt c_0^+$ is present. For thicker intermediate layers, two or even three such branches can be perceived. The branches are colour coded based on the quadrant of the $(\zeta _1,\zeta _2)$ -plane to which the corresponding conjugate solutions belong. Orange and blue will typically correspond to solutions of mode-1, and red and green to solutions of mode-2. However, it can be observed from the figure that solutions sought in odd quadrants do exist with speeds less than $c_0^+$ , as well as solutions sought in even quadrants for speeds larger than $c_0^+$ (see the red branch for large values of $H_2/H$ ).

Figure 8. Solutions to the conjugate state (3.1), under the Boussinesq approximation. In each panel we plot the branches of conjugate flow speeds that exist for a fixed ratio $H_1/H_3$ and varying $H_2/H$ . The physical parameters here correspond to those in figure 6. Panels (a)(c) show results for increasing values of $H_1/H_3$ , respectively $H_1/H_3=0.8, \, \approx 2.5155, \,20$ . The colour scheme in panel $(d)$ allows us to identify the quadrant of the $(\zeta _1,\zeta _2)$ -plane to which the corresponding conjugate state belongs to. The black dashed curves are the linear long-wave speeds.

A method has recently been proposed by Lamb (Reference Lamb2023) to determine where in the parameter space ISWs of both polarities exist in a continuous double-pycnocline stratification. The same method can equally be applied to our piecewise-constant stratification and consists of identifying the physical parameters for which a blue and orange branch coexist with $c\gt c_0^+$ . According to the author, ‘This method is somewhat tedious as it requires the determination of where the conjugate flow speed is equal to the linear long-wave propagation speed’. Our analysis will confirm that the onset of the desired feature is indeed predicted by instances where the conjugate flow speed matches the linear long-wave speed $c=c_0^+$ . However, it will also become evident that not all such instances lead to the expected feature (see figure 10). Furthermore, it will be shown that the conjugate state analysis extends beyond this point, and previously unexplored aspects of the conjugate flow speed curves, which for brevity, we will refer to as the conjugate curves, will enable the discovery of new types of solution branches; cf. figure 7.

In figure 8 $(c)$ the tangency between the conjugate curve and the curve $c=c_0^+$ is what sets the coexistence of the two branches of opposite polarity (orange and blue) with $c\gt c_0^+$ . In contrast, in figure 8 $(a)$ , it is the instance when the orange branch intersects the curve $c=c_0^+$ that sets such occurrence. Although not particularly visible in the figure, this intersection precedes a tangency between the orange branch and the curve $c=c_0^+$ (at $H_2/H\approx 0.398$ ). By preserving the same value $\delta =1$ and varying the ratio $H_1/H_3$ , similar figures can be obtained. By doing so, we can cover the entire parameter space as in figure 9 $(a)$ and highlight the regions where conjugate curves of opposite polarities exist for $c\gt c_0^+$ , simply by observing the intersections (blue dots) and tangency points (blue solid line) between the conjugate curve and the curve $c=c_0^+$ . This naturally raises the question of how this relates to the coexistence of ISWs with opposite polarities. Our extensive numerical tests confirm that when conjugate states of opposite polarities coexist with $c\gt c_0^+$ , wave fronts can be exhibited for each conjugate state (in both the strongly and fully nonlinear theories), ultimately limiting the amplitude of solitary-wave solution branches. This further supports the validity of the criterion proposed by Lamb (Reference Lamb2023). An additional justification for its validity will be presented in the following sections within the framework of strongly nonlinear theory.

Figure 9. Region (shaded) of coexistence of ISWs of opposite polarity for different values of $\delta$ , according to the strongly and fully nonlinear theories, under the Boussinesq approximation. Instances where the conjugate flow speed matches the linear long-wave speed $c=c_0^+$ are represented in blue (dots for intersections and a solid line for tangency points). It can be observed that the set of tangency points coincides with the locus corresponding to the criticality condition (2.23) for the KdV equation. The purple curves correspond to the vanishing of the cubic nonlinearity coefficient for the Gardner equation for $\zeta _1$ , and are shown to illustrate the stark difference between Gardner and MMCC3 predictions. According to the Gardner theory for $\zeta _1$ , coexistence of ISWs exist only in the region enclosed by this curve (see, e.g. figure 4 b).

In figure 9 we overlay the locus (in purple) where the cubic nonlinearity of the Gardner equation vanishes, allowing for a comparison with the weakly nonlinear theory. According to the Gardner theory, ISWs of opposite polarity coexist within each closed region enclosed by a purple curve. For $\delta =1$ , some discrepancies can be observed in panel $(a)$ . However, these differences become even more pronounced as $\delta$ varies (see panel b).

Returning to the point made earlier on unexplored aspects of the conjugate curves allowing us to unveil new types of ISWs, we first highlight the intricate behaviour of these curves, as depicted in figure 10. In the figure, with $\delta =0.5$ , $H_1/H_3=0.1$ , in addition to the intersections and tangencies between the conjugate curves and the curve $c=c_0^+$ , we also track the self-intersections (grey squares) of the conjugate curves, and where they first emerge (black square), provided $c\gt c_0^+$ . By varying $H_1/H_3$ while keeping $\delta$ constant, the entire parameter space can be explored. All these markers can then be compiled, as in figure 11 $(a)$ , to delimit regions where distinct ISWs properties arise. This classification is further illustrated in figure 11 $(b)$ , following the colour scheme outlined in figure 7 (see also figure 17 with $\delta =1$ to see how the complete solution space compares with figure 9 $a$ ). How these markers separate the bifurcation spaces, and why, is explored in the following sections.

Figure 10. Solutions to the conjugate state (3.1), under Boussinesq approximation, for $\delta =0.5$ and $H_1/H_3=0.1$ , using the same colour scheme as figure 8. The figure also includes vertical dotted curves, which deliminate the various solution space behaviours given by panels A–F and $M2$ in figure 7. Panel $(b)$ shows a blow-up of the region marked by a box in panel $(a)$ . In panel $(b)$ we highlight the appearance of two additional conjugate state branches (a maximum and a slower saddle) with $c\gt c_0^+$ with a black square, the self-intersection of two mode-1 conjugate states (both maxima) with $c\gt c_0^+$ with a grey square and mode-1 conjugate states passing through $c=c_0^+$ with a blue circle.

Figure 11. All details of our conjugate state analysis encapsulated in one figure. In panel $(a)$ we identify in the parameter region for $\delta =0.5$ all relevant instances for the conjugate curves. The main markers are defined in figures 9 and 10 $(b)$ . In addition, the red line denotes the set of all intersections between a red conjugate curve and the linear long-wave speed $c=c_0^+$ (see figure 10 $a$ for large values of $H_2/H$ ). This information can be used to delimit regions where different properties for mode-1 ISWs can be found, as in panel $(b)$ . The colour scheme used here is outlined in figure 7. The purple lines represent the condition $c_3=0$ , included for comparison with predictions from Gardner theory for $\zeta _1$ .

4. Strongly nonlinear theory: a critical point analysis

The solitary-wave solutions for the MMCC3 model are governed by the following dynamical system (see Barros et al. Reference Barros, Choi and Milewski2020):

(4.1) \begin{align} & c^2 \left \{ \frac {1}{3} \left (\rho _1 \frac {H_1^2}{h_1} + \rho _2 \frac {H_2^2}{h_2} \right ) \zeta _1^{\prime \prime }\mkern -1.2mu + \frac {1}{6} \rho _2 \frac {H_2^2}{h_2} \, \zeta _2^{\prime \prime }\mkern -1.2mu + \frac {1}{6} \left ( \rho _1 \frac {H_1^2}{h_1^2} - \rho _2 \frac {H_2^2}{h_2^2} \right ) {\zeta _1^\prime }^2 + \frac {1}{3} \rho _2 \frac {H_2^2}{h_2^2} \,\zeta _1^\prime \zeta _2^\prime \right. \nonumber \\ &\quad \left. +\, \frac {1}{3} \rho _2 \frac {H_2^2}{h_2^2} \,{\zeta _2^\prime }^2 \right \} = \frac {1}{2}c^2 \left [ (\rho _2-\rho _1) + \rho _1 \frac {H_1^2}{h_1^2} - \rho _2 \frac {H_2^2}{h_2^2} \right ] - g(\rho _2-\rho _1) \zeta _1, \end{align}
(4.2) \begin{align} & c^2 \left \{ \frac {1}{6} \rho _2 \frac {H_2^2}{h_2} \, \zeta _1^{\prime \prime }\mkern -1.2mu + \frac {1}{3} \left ( \rho _2\frac {H_2^2}{h_2} +\rho _3 \frac {H_3^2}{h_3} \right ) \zeta _2^{\prime \prime }\mkern -1.2mu - \frac {1}{3} \rho _2 \frac {H_2^2}{h_2^2} \, {\zeta _1^\prime }^2 - \frac {1}{3} \rho _2 \frac {H_2^2}{h_2^2} \, \zeta _1^\prime \zeta _2^\prime \right. \nonumber \\ &\quad \left. +\, \frac {1}{6} \left ( \rho _2 \frac {H_2^2}{h_2^2} - \rho _3 \frac {H_3^2}{h_3^2} \right ) {\zeta _2^\prime }^2 \right \} = \frac {1}{2}c^2 \left [ (\rho _3-\rho _2) + \rho _2 \frac {H_2^2}{h_2^2} - \rho _3 \frac {H_3^2}{h_3^2} \right ] - g(\rho _3-\rho _2)\zeta _2 .\end{align}

Here $V(\zeta _1,\zeta _2)$ is defined by

(4.3) \begin{align} V(\zeta _1,\zeta _2) &= -\frac {1}{2} c^2 \left ( \rho _2-\rho _1 + \rho _1 \frac {H_1}{h_1} \right ) \zeta _1 + \frac {1}{2} (\rho _2-\rho _1)g\,\zeta _1^2 \nonumber \\ &\quad -\, \frac {1}{2} c^2 \left ( \rho _3-\rho _2 - \rho _3 \frac {H_3}{h_3} \right ) \, \zeta _2 + \frac {1}{2} (\rho _3-\rho _2) g \, \zeta _2^2 - \frac {1}{2} c^2 \rho _2 \frac {H_2}{h_2} (\zeta _2-\zeta _1). \end{align}

It can be shown that the system is equivalent to a Hamiltonian system with two degrees of freedom, for which $V$ is the potential. Furthermore, $V$ has been chosen to satisfy $V(0,0) =0$ , i.e. the origin lies on the zero energy level of the system.

The critical points of our dynamical system are precisely the critical points of $V$ . These are found as solutions of $\boldsymbol{\nabla }V (\zeta _1,\zeta _2)=(0,0)$ and can be classified by examining the Hessian matrix of $V$ , here denoted by $\mathcal{H}$ :

(4.4) \begin{align} {\mathcal{H}} = \left [ \begin{array}{cc} g (\rho _2-\rho _1) -\left ( \rho _1\frac {H_1^2}{h_1^3}+\rho _2 \frac {H_2^2}{h_2^3} \right ) c^2 & \rho _2 \frac {H_2^2}{h_2^3} c^2\\[7pt] \rho _2 \frac {H_2^2}{h_2^3} c^2 & g (\rho _3-\rho _2) - \left ( \rho _2 \frac {H_2^2}{h_2^3}+\rho _3 \frac {H_3^2}{h_3^3} \right ) c^2 \end{array} \right ]\!. \end{align}

Clearly, at the origin, $\boldsymbol{\nabla }V=0$ and ${\text{det}\,} {\mathcal{H}}$ is simply a multiple of the left-hand side of (2.12). As a result, $(0,0)$ is a non-degenerate critical point for the range of speeds ${\mathbb{R}}^{+} \setminus \{c_0^{\pm }\}$ . It becomes a saddle of $V$ when ${\text{det}\,} {\mathcal{H}}(0,0)\lt 0$ , i.e. when $c_0^{-}\lt c\lt c_0^{+}$ , and a local extremum otherwise. More precisely, (2.12) leads to the inequality

(4.5) \begin{equation} \left (c_0^{-}\right )^2 \lt \frac {g H_1 H_2 (\rho _2-\rho _1)}{\rho _1 H_2 + \rho _2 H_1}\lt \left (c_0^{+}\right )^2\!, \end{equation}

from which it follows that the origin is a local minimum (maximum) of $V$ in the range $]0,c_0^{-}[$ ( $]c_0^{+}, \infty [$ ).

4.1. Geometric locus of the critical points

Consider fixed values of $\rho _i$ and $H_i$ ( $i=1,2,3$ ). Then, for a given wave speed $c$ , the set of critical points is found by solving $ {\partial V}/{\partial \zeta _1}= {\partial V}/{\partial \zeta _2}=0$ . The admissible solutions are those within the triangular region $\mathcal{R}$ :

(4.6) \begin{equation} {\mathcal{R}} = \{ (\zeta _1,\zeta _2)\in {\mathbb{R}}^2: \, h_i\gt 0, \quad i=1,2,3\}. \end{equation}

Given that the potential $V$ is a rational function, finding its critical points is equivalent to finding the intersection of two plane algebraic curves:

(4.7) \begin{align} {\mathcal{C}}_1 & \equiv \frac {1}{2} c^2 \big [ (\rho _2-\rho _1) h_1^2 h_2^2 + \rho _1 H_1^2 h_2^2 - \rho _2 H_2^2 h_1^2 \big ] -(\rho _2-\rho _1)g \,\zeta _1 h_1^2 h_2^2 =0, \\[-12pt]\nonumber \end{align}
(4.8) \begin{align} {\mathcal{C}}_2 & \equiv \frac {1}{2} c^2 \big [ (\rho _3-\rho _2) h_2^2 h_3^2 + \rho _2 H_2^2 h_3^2 - \rho _3 H_3^2 h_2^2 \big ] -(\rho _3-\rho _2)g \,\zeta _2 h_2^2 h_3^2 =0. \end{align}

The fact that the origin remains a critical point for all values of $c$ can be interpreted geometrically by noting that both ${\mathcal{C}}_1 = 0$ and ${\mathcal{C}}_2 = 0$ are homogeneous curves. Eliminating $c$ from (4.7) and (4.8) yields the following relationship between $\zeta _1$ and $\zeta _2$ :

(4.9) \begin{equation} P(\zeta _1,\zeta _2) \equiv (\rho _2-\rho _1) \zeta _1 \,h_1^2 \,G_3(h_2,h_3) - (\rho _3-\rho _2) \zeta _2 \,h_3^2 \,G_1 (h_1,h_2) =0. \end{equation}

Here

(4.10) \begin{align} G_1(h_1,h_2) & =(\rho _2-\rho _1)h_1^2 h_2^2 + \rho _1 H_1^2 h_2^2 - \rho _2 H_2^2 h_1^2, \\[-12pt]\nonumber \end{align}
(4.11) \begin{align} G_3(h_2,h_3) &= (\rho _3-\rho _2)h_2^2 h_3^2 + \rho _2 H_2^2 h_3^2 - \rho _3 H_3^2 h_2^2. \end{align}

Within the admissible region $\mathcal{R}$ , the two sets of (4.7), (4.8) and (4.8), (4.9) are equivalent. In particular, any critical point of $V$ lies on the geometric locus defined by the curve $P=0$ in (4.9). Conversely, it can be shown (see Lemma1 of Appendix B) that for any point $(\zeta _1, \zeta _2)$ on the curve $P=0$ there exists a real value of $c$ for which (4.8), (4.9) are satisfied, and hence $(\zeta _1, \zeta _2)$ is a critical point of $V$ .

Figure 12. Plots for the curve $P=0$ in (4.9). We set $(H_1,H_2,H_3)=(3,1,6)$ and adopt the stratification (2.16) with $\varDelta _1=\delta \varDelta _2$ and $\varDelta _2=0.1$ . $(a)$ $\delta =1$ , $(b)$ $\delta \approx 1.70824$ , $(c)$ $\delta =2$ . When $c\approx 0$ , there are seven critical points of $V$ : three local maxima (M), three saddles (S), one minimum (m).

The curve $P=0$ can be readily visualised in the $(\zeta _1,\zeta _2)$ -plane once all physical parameters are specified, revealing two main distinct configurations; cf. panels $(a)$ and $(c)$ of figure 12. In either case, the curve $P=0$ consists of three distinct branches: branch I, which connects vertices $(-H_2-H_3,-H_3)$ and $(H_1, H_1+H_2)$ ; branch II, which connects the point $ ( -H_2/ (1+\delta ), H_2 \delta /(1+\delta ) )$ to one of the two points $(0,-H_3)$ or $(H_1,0)$ ; branch III, which connects $(H_1,-H_3)$ to either $(0,-H_3)$ or $(H_1,0)$ . All endpoints of these branches correspond to critical points in the limit when $c\rightarrow 0^+$ . As $c$ increases, all points (except the origin) move inward. Three maxima (M) depart from the vertices of the triangle $\mathcal{R}$ ; three saddles (S) depart from each side of the triangle; the origin remains a minimum (m) until $c$ reaches the value $c_0^-$ . These points move according to two basic rules: (i) if two points ever meet at any location other than the origin, they cannot pass each other; (ii) as $c$ increases, any given point on a branch of $P=0$ must remain on that same branch. The first rule follows from a simple observation: from (4.7), or (4.8), it is evident that each point on $P=0$ , apart from the origin, corresponds to a unique value of $c$ . The second rule is more subtle and requires a technical justification, which is provided in Appendix D.

The origin is a somewhat special point. It is the only critical point that will not change its position as the wave speed varies. Moreover, it is not a regular point of $P=0$ . In fact, at this double point (or crunode), the tangents to the curve $P=0$ can be found as $\zeta _2=\gamma ^\pm \,\zeta _1$ as in (2.13) (see Lemma2 in Appendix B).

For certain exceptional values of the physical parameters, the curve $P=0$ may exhibit an additional singular point. Assuming the stratification given in (2.16) with $\varDelta _1=\delta \varDelta _2$ , and fixing the parameters $\varDelta _2$ , $H_i$ ( $i=1,2,3$ ), the specific value of $\delta$ that gives rise to an extra crunode can be determined by solving the system $P=0$ , $ {\partial P}/{\partial \zeta _1}=0$ and $ {\partial P}/{\partial \zeta _2}=0$ , simultaneously. In the particular case depicted in figure 12, this critical value is approximately $\delta \approx 1.70824$ , as shown in panel $(b)$ . If, instead, the Boussinesq approximation were applied, this transition between the two curve configurations would occur precisely at $\delta =2$ (corresponding to $=H_3/H_1$ ), as demonstrated in Appendix B.

4.2. Collision of critical points

Consider fixed values of $\rho _i$ and $H_i$ ( $i=1,2,3$ ). As the wave speed varies, the surface of the potential $V$ is smoothly deformed and the number and nature of critical points remains invariant, unless a collision takes place. Examining when collisions occur is thus a crucial step in the critical point analysis.

4.2.1. Trivial collisions

We begin by examining the collision of critical points at the origin, which we refer to as ‘trivial’ collisions. It is important to note that the origin is a regular point of both curves ${\mathcal{C}}_1 = 0$ and ${\mathcal{C}}_2 = 0$ . Specifically, the tangents to each curve at the origin are given, respectively, by

(4.12) \begin{equation} \begin{aligned} \big [ (\rho _1 H_2 + \rho _2 H_1)c^2 - g(\rho _2-\rho _1)H_1 H_2 \big ] \zeta _1 - \rho _2 H_1 c^2 \,\zeta _2 &=0,\\ \rho _2 H_3 c^2 \,\zeta _1 - \big [(\rho _2 H_3 + \rho _3 H_2)c^2 - g(\rho _3-\rho _2)H_2 H_3 \big ] \zeta _2 &=0. \end{aligned} \end{equation}

Moreover, it can be shown that $ {\partial {\mathcal{C}}_k}/{\partial \zeta _2} \neq 0$ for any point on the curve ${\mathcal{C}}_k$ ( $k=1,2$ ). We can therefore use the following result (Coolidge, Theorem 4, pp. 16–17): in the vicinity of an ordinary finite point where the tangent is not vertical, $\zeta _2$ may be expressed as a convergent power series in terms of $\zeta _1-\zeta _{1,0}$ , where $\zeta _{1,0}$ is the abscissa of the point in question. For our purposes, we set $\zeta _{1,0}=0$ and seek expansions:

(4.13) \begin{equation} \zeta _2 = p_k \,\zeta _1 + q_k \,\zeta _1^2 + r_k \,\zeta _1^3 + O(\zeta _1^4), \quad k=1,2. \end{equation}

The coefficients $p_k$ , $q_k$ , $r_k$ ( $k=1,2$ ) are all given in (C4) of Appendix C. A double collision at the origin holds when $p_1=p_2$ , which is precisely when (2.12) is satisfied. In other words, this type of collision arises only at the linear long-wave speeds $c=c_0^{\pm }$ , at which the tangent lines given in (4.12) become identical.

For a triple collision at the origin to occur, not only the first- but also the second-order approximation to the curves must coincide, i.e.

(4.14) \begin{equation} p_1=p_2, \quad q_1=q_2. \end{equation}

It is shown in Appendix C that (4.14) holds precisely at criticality, i.e. when (2.23) is satisfied.

Lastly, a quadruple collision at the origin occurs when

(4.15) \begin{equation} p_1=p_2, \quad q_1=q_2, \quad r_1 = r_2. \end{equation}

As shown in Appendix C, this situation arises precisely when, at criticality, $T_1=0$ , with $T_1$ defined by (2.31). In other words, a quadruple collision at the origin occurs exactly when both the quadratic and cubic coefficients, $c_1$ and $c_3$ , respectively, of Gardner’s equation vanish simultaneously. To understand this, note that the coefficient $c_3$ in (2.30) vanishes when $T_1+T_2+T_3=0$ . Since $T_2$ and $T_3$ are both proportional to $c_1$ , we deduce that, at criticality (i.e. when $c_1=0$ ), the cubic nonlinearity coefficient can only vanish if $T_1=0$ . Another important consequence is that quadruple collisions can only occur at $c=c_0^+$ .

In figure 3 we can identify in panel $(b)$ three instances in the parameter space where a quadruple collision occurs. In all other panels, only one instance of this type of collision is possible. To gain a better understanding of how many such collisions can occur in the parameter space, it is helpful to resort to the Boussinesq approximation. Under this reduction, a quadruple collision is equivalent to

(4.16) \begin{equation} \tilde {T}_1=0, \quad H_1^2 H_2^2 \,\gamma ^3 + H_1^2 H_3^2 \,(1-\gamma )^3 - H_2^2 H_3^2=0, \quad \gamma ^2+ \varLambda \gamma - \delta =0, \end{equation}

with $\varLambda =\delta (1+H_2/H_3)-1-H_2/H_1$ . The first condition in (4.16) results from viewing $T_1=0$ as the vanishing of a multivariate polynomial, similarly to (C21), but under Boussinesq approximation. The second condition is equivalent to the criticality condition. The third results from imposing that the collision occurs at the linear long-wave speeds (see (B.4) in Barros et al. Reference Barros, Choi and Milewski2020). Solving this system simultaneously through use of the Gröbner basis (see, e.g. Cox et al. Reference Cox, Little, O’shea and Sweedler1997; Cox, Little & O’Shea Reference Cox, Little and O’Shea2005), we arrive at a condition (more specifically, an algebraic curve) involving only $\delta$ and $H_1/H_3$ . This is very convenient, since with one single plot a full description can be provided, as in figure 13. For values of $\delta$ close to 1, there will be three instances where a quadruple collision occurs, whereas in general only one quadruple collision is expected. Specifically, for $\delta =1$ , quadruple collisions occur for the following values of $H_1/H_3$ : $\approx 0.397536, \, 1.0, \, \approx 2.515$ . In figure 6, quadruple collisions correspond to intersections between the blue and purple curves. Quadruple collisions help define regions within the parameter space where distinct ISW properties arise, as illustrated in figure 11 and further demonstrated in figures 17 and 18.

Figure 13. Instances in the parameter space where a quadruple collision occurs at the origin. Here, the Boussinesq approximation is adopted and, for each value of $\delta =\delta _0$ , the number of quadruple collisions may be read from the plot by counting how many times the horizontal line $\delta =\delta _0$ intersects the graph.

4.2.2. Non-trivial collisions

Whenever two critical points coalesce, they form a degenerate critical point, which can be identified by the vanishing of the determinant of the Hessian matrix in (4.4) at that specific location. Let $(\zeta _1,\zeta _2)\neq (0,0)$ be the coordinates of such a point. Then,

(4.17) \begin{equation} {\text{det}\,} {\mathcal{H}} (\zeta _1,\zeta _2)=0 \end{equation}

is a condition that also depends on the value of speed $c$ . However, given that $(\zeta _1,\zeta _2)$ is a critical point, the corresponding value of $c$ can be easily determined from (4.8) and substituted back into (4.17). Eliminating denominators and simplifying the expression leads to a family of plane algebraic curves given by $F(\zeta _1,\zeta _2)=0$ , which depends on the parameters $\rho _i$ , $H_i$ ( $i=1,2,3$ ). Once these parameters are fixed, the coordinates $(\zeta _1,\zeta _2)$ corresponding to non-trivial collisions can be found as the intersection points of the two plane algebraic curves $P=0$ and $F=0$ . Figure 14 shows some demonstrative examples of the locations of non-trivial collisions. This figure will be revisited in § 5 to explain the bifurcation of panels $A$ $D$ in figure 7. Note that while the explicit expression for $F$ is unfortunately too cumbersome to be presented here, the results in figure 14 can be reproduced using any symbolic programme, such as Mathematica. In order to enable a more thorough investigation of collisions, we may employ the same methodology used for the conjugate states (see figure 8) to generate figure 15. This approach reveals not only the quadrants of the $(\zeta _1,\zeta _2)$ -plane in which collisions occur, but also the corresponding wave speeds at which these events take place. As illustrated, for fixed values of the density and undisturbed thickness of each layer, there exist either three or five non-trivial collisions.

Figure 14. Plots of the curve $P=0$ in (4.9) and points where non-trivial collisions occur. The stratification is given by (2.16), with $\varDelta _1= \varDelta _2=0.1$ , and the undisturbed thicknesses of each layer parameters are set as follows: $(a)$ $(H_1,H_2,H_3)=(2,6,1)$ , $(b)$ $(H_1,H_2,H_3)=(1,7,2)$ , $(c)$ $(H_1,H_2,H_3)=(3,1,6)$ , $(d)$ $(H_1,H_2,H_3)=(4,1,2)$ . The points in black (red) correspond to collisions leading to a decrease (increase) of the number of critical points. In each panel we overlay the zero contour of the potential $V$ , obtained for $(c/c_0^+)^2=1.01$ . The mode-1 ISWs for the physical parameters in panels $(a)$ $(d)$ resemble, respectively, those illustrated in panels $A$ $D$ of figure 7.

Figure 15. Collision curves on the $(H_2/H,c/\sqrt {gH})$ -plane for the stratification (2.16), with $\varDelta _1= \varDelta _2=0.1$ . $(a)$ $H_1/H_3=0.5$ , $(b)$ $H_1/H_3=2$ . The black dashed curves are the linear long-wave speeds, at which trivial collisions occur. Non-trivial collisions are represented by dotted (solid) lines when leading to a decrease (increase) of the number of critical points. We adopt the same colour scheme as in figure 8 to reveal the quadrant of the $(\zeta _1,\zeta _2)$ -plane where the collisions occur.

In general, a non-trivial collision leads to a decrease in the number of critical points. This is an immediate consequence of the first rule of how critical points evolve: since two points cannot move past one another, they must annihilate upon collision. This also poses a constraint on the nature of the critical points that collide. As a result of a theorem by Kronecker (see Theorem 2.4.6 in Emelyanov et al. Reference Emelyanov, Korovin, Bobylev and Bulatov2007), the total sum of topological indices of critical points is 1 and must remain invariant as the value of $c$ is increased (see Barros Reference Barros2016 for more details). Thus, no local maximum (or minimum) of the potential $V$ can collide with another point other than a saddle. In addition, after all collisions have taken place, only a single critical point remains: the origin, as a local maximum of $V$ . Let $c=c_{\!f}$ ( $\gt c_0^+$ ) denote the speed at which the final collision takes place. Beyond this speed, no solitary-wave solutions can exist. This critical speed thus serves as an upper bound for the propagation speed of mode-1 ISWs, as illustrated by the blue dotted curve in figure 15 $(a)$ , and it inherently limits their amplitude.

We now turn to the exceptional case in which, unexpectedly, two critical points emerge spontaneously. These occurrences are indicated by red markers in figure 14 and by solid lines in figure 15. Panels $(b)$ and $(c)$ of figure 14 are related to figure 15 $(a)$ , since in both cases $H_1/H_3=0.5$ . Similarly, panels $(a)$ and $(d)$ of figure 14 are related to figure 15 $(b)$ , since in both cases $H_1/H_3=2$ . From figure 15, emergent collisions are to be expected for large values of $H_2/H$ , which explains the contrast between the bottom and top panels of figure 14. While `typical’ collisions – where critical points annihilate – occur along all branches of the curve $P=0$ , our extensive numerical investigations (see figures 15 and 16) indicate that such emergent collisions occur exclusively along branch I. From this, we conclude that these events correspond to the simultaneous emergence of a saddle point and a local maximum. In the two cases considered in figure 15, such occurrences are observed only for speeds below $c_0^+$ . However, as illustrated in figure 16, emergent collisions can also arise at speeds exceeding $c_0^+$ .

Figure 16. In panel $(a)$ , similar to figure 15, collision curves are displayed on the $(H_2/H,c/\sqrt {gH})$ -plane for the parameters $\varDelta _1=0.2$ , $\varDelta _2=0.1$ and $H_1/H_3=10$ . The two other panels illustrate distinct situations in panel $(a)$ , obtained with different values of $H_2/H$ : $(b)$ $H_2/H=0.28$ , $(c)$ $H_2/H=0.6$ . In both plots we set $H_1=1$ and overlay the zero contour of the potential $V$ , obtained for $(c/c_0^+)^2=1.01$ .

5. The strongly nonlinear solution space: distinguishing cases $\boldsymbol{A - F}$

In this section we present a way to distinguish the characteristics of the solution space for mode-1 solitary waves for the MMCC3 system by tracking the behaviour of the critical points of the potential. From a dynamical systems perspective, mode-1 ISWs for the MMCC3 model arise as trajectories departing from the local maximum at the origin towards a saddle located on one or both sides of branch I of $P=0$ . The return point of such trajectories is a point at which $V=0$ .

Let us revisit figure 14. The top panels show five non-trivial collisions of critical points, with one highlighted in red to indicate an emergent collision, whereas the lower panels display only three such collisions. Also depicted are the zero contours of $V$ at $c=c_0^+(1+\epsilon)$ ( $\epsilon \ll 1$ ), which appear as simple closed loops superimposed on the plots of $P=0$ . Each loop contains a special point that serves as a return point for the trajectory of a mode-1 ISW.

Assuming mode-1 ISWs exist for given physical parameters and no emergent collisions occur as c increases, the scenario corresponds to that shown in the lower panels of figure 14, where only a single loop appears, indicating that solutions of opposite polarity do not coexist. The figure further reveals that mode-1 solutions may be of depression (panel c), or elevation (panel d), depending on the quadrant in which the loop resides. The polarity predictions from the strongly nonlinear theory are in full agreement with those of the KdV theory, as shown in Appendix D. According to figures 15 and 16, this behaviour is expected when the ratio $H_2/H$ is sufficiently small. In this regime, the solutions resemble those illustrated in panels $C$ and $D$ of figure 7. It is important to note, however, that the behaviour in panels $C$ and $D$ can also be observed in other regimes, as shown below.

In what follows, we establish the existence of conjugate states of opposite polarity with $c\gt c_0^+$ as the criteria for the coexistence of solutions of opposite polarity (panels A and B), as found by Lamb (Reference Lamb2023) for a smooth double-pycnocline stratification. We then distinguish between the panels that do not exhibit multiplicity of solutions (panels CF). Finally, we sweep the parameter space in the Boussinesq case, presenting our findings for varying $\delta$ on the $(H_1/H,H_2/H)$ space.

5.1. Criteria for the coexistence of ISWs of opposite polarity (panels A and B)

Based on the discussion above, we conclude that an emergent collision is a necessary condition for the coexistence of mode-1 waves with opposite polarities. This is exemplified by the physical parameters corresponding to the top panels of figure 14. However, as illustrated in figures 16 $(b)$ and 16 $(c)$ , the presence of an emergent collision alone is not sufficient: solutions of opposite polarity fail to materialise in both cases. To guarantee the coexistence of solutions of opposite polarity, we require that, at $c=c_0^+$ , two local maxima of $V$ exist on either side of branch I, with both satisfying $V\gt 0$ . In light of how critical points evolve, this implies that (i) an emergent collision (yielding a saddle and a maximum) must occur at $c\lt c_0^+$ , (ii) the resulting maximum must reach the origin as $c$ approaches $c_0^+$ . Furthermore, the polarity of the solution branch bifurcating from zero amplitude at the linear long-wave speed $c=c_0^+$ is determined by the quadrant of the $(\zeta _1,\zeta _2)$ -plane where the emergent collision takes place. Specifically, in figure 14 $(a)$ the emergent collision occurs in the first quadrant, resulting in a depression branch bifurcating from zero amplitude, while the elevation branch bifurcates from finite amplitude. A similar situation, but with the roles of the elevation and depression branch reversed, can be observed in figure 14 $(b)$ , where the emergent collision occurs in the third quadrant. In the regimes considered, the solutions resemble those illustrated in panels $A$ and $B$ of figure 7, respectively.

Identifying the parameter region for the coexistence of solutions with opposite polarity is quite challenging when relying solely on the analysis of collisions. Based on figure 15, one might be inclined to think that the desired feature holds for large values of $H_2/H$ , since collision curves in orange and blue exist for $c\gt c_0^+$ (following an emergent collision at $c\lt c_0^+$ ). This is a clear indication that, at $c=c_0^+$ , $V$ has two maxima on either side of branch I, which eventually collide with saddle points as $c$ increases. However, this approach does not a priori provide a guarantee of the desired outcome, as we cannot confirm if, originally at $c=c_0^+$ , $V\gt 0$ at the two maxima. This can be overcome by observing that, for that to happen, the two loops on either side of branch I should both shrink to a point, as $c$ increases from the value $c=c_0^+$ . When that happens (for each loop), a maximum becomes an equilibrium at the same energy level as the origin, i.e. $V= {\partial V}/{\partial \zeta _1}= {\partial V}/{\partial \zeta _2}=0$ , and thus, a conjugate state is reached and (3.1) is recovered. Therefore, if conjugate states of opposite polarity coexist for $c \gt c_0^+$ , the presence of ISWs with opposite polarity is assured. This finding supports the validity of the criterion proposed by Lamb (Reference Lamb2023) (see § 3.1), at least within the scope of the strongly nonlinear theory.

5.2. Criteria to distinguish panels C–F

The above highlights the benefits of using a conjugate state analysis to identify specific behaviours of solutions. Accordingly, the criteria presented hereafter are based on the properties of the conjugate states for this physical system.

Assume, for a given set of parameters, that conjugate states of opposite polarity, both with $c\gt c_0^+$ , do not exist. In this case, extensive numerical tests reveal that, for all $c\gt c_0^+$ , all critical points of the potential $V$ along branch I lie in the first (third) quadrant if $c_1\gt 0$ ( $c_1\lt 0$ ). Hence, the KdV theory correctly predicts the polarity of the solitary waves in panels $C$ $F$ . We note that it is also possible, for the chosen parameters, that a conjugate state exists on branch II. As a result, in addition to the waves discussed above, there are other solutions exhibiting mode-2 characteristics, which will be addressed separately in § 5.3.

In terms of conjugate states for the system, a few cases must be considered. The simplest case is that there is only one mode-1 conjugate state with $c\gt c_0^+$ . In such cases, there exists a single solution branch of mode-1 solitary waves (panel $C$ or $D$ ), starting from zero amplitude and limiting to a tabletop solitary wave.

The case when there are two or three mode-1 conjugate states with $c\gt c_0^+$ in the first or third quadrant requires more delicate consideration. Without loss of generality, consider the case where two or three mode-1 conjugate states with $c\gt c_0^+$ exist in the first quadrant. For this to occur, we must have $c_1\gt 0$ , and the emergent collision of critical points must have occurred in the first quadrant along branch I, such as observed in figure 16(b). Note that this emergent collision occurs in between the origin and the maximum departing from $(H_1,H_1+H_2)$ in the limit when $c\rightarrow 0^+$ , since each point along the curves $P=0$ corresponds to a unique value of $c$ . The criterion for distinguishing whether we have one branch of solutions (panel $D$ ) or two branches (panel $F$ ) is as follows. As long as the speed at which the emergent maximum becomes a conjugate state is less than the speed at which the other maximum does, we have panel $F$ . Otherwise, we have panel $D$ . Analogous consideration applies to the case when two or three mode-1 conjugate states with $c\gt c_0^+$ exist in the third quadrant. This is shown, for example, in figure 10. At the black square, where the emergent collision occurs at $V=0$ with $c\gt c_0^+$ , the speed of the emergent maximum conjugate state (given by the black square) is less than the other maximum (the blue curve above the black square). However, as $H_2$ increases, the two front curves cross at the grey square. Hence, past the grey square, the speed of the emergent maximum as a conjugate state is faster than that of the other maximum. We see here that the black square and grey square signal changes in behaviour, while the crossing of the front curves through $c=c_0^+$ (blue circle) in this case do not. Greater detail on the solution space for panels $E$ and $F$ is given in § 6. Next, we use the above criteria to present the Boussinesq solution space for all parameters.

Figure 17. Partition of the (Boussinesq) parameter space into regions associated with distinct solution types. The dotted curves in each panel are the instances in the parameter space at which the front curve crosses the curve $c=c_0^+$ . The blue curve corresponds to criticality, at which the front curve and the curve $c=c_0^+$ are tangent. The other markers in the panels (when applicable) are the black and grey squares discussed in figure 11 $(b)$ . The red curve represents the instance when the front curve bearing characteristics of mode-2 crosses the curve $c=c_0^+$ . In the region above this red line, not only coexistence of ISWs of opposite polarity exist, but also large amplitude solutions with mode-2 characteristics. The panels show results for different values of $\delta$ . The colour scheme used here is outlined in figure 7. The purple lines represent the condition $c_3=0$ , included for comparison with predictions from Gardner theory for $\zeta _1$ .

Figure 18. Same as in figure 17, but for $\delta \gt 1$ .

5.3. Parameter sweep of the Boussinesq solution space

The above provides a mechanism for determining the solution space from the behaviour of the conjugate states. Figures 17 and 18 demonstrate how the solution space evolves as one varies $\delta$ within the Boussinesq regime. The colour scheme corresponds to panels AF in figure 7.

Plots such as figure 8 show conjugate states as a function of $H_2$ for given $\delta$ and varying $H_1/H_3$ , where each value of $H_1/H_3$ represents a ray in the parameter space, as shown in figure 6. To create figures 17 and 18, a representative set of rays spanning the parameter space is selected. From each ray, key markers are extracted as illustrated in figure 10, including blue circles (mode-1 conjugate state with $c=c_0^+$ ), black squares (emergent collision coinciding with being a conjugate state), grey squares (two conjugate states in the same quadrant having the same speed) and the red curve (mode-2 conjugate state with $c=c_0^+$ ). All of these points are recovered numerically. The plots also display blue and purple lines, which are obtained analytically. As can be seen from figures 8 and 10, each point of tangency between conjugate curves and $c=c_0^+$ is marked by a colour change (from blue to orange or vice versa). Within the MMCC3 theory, this means that tangencies correspond to a triple collision at the origin, which holds at criticality – specifically when $c_1=0$ in the KdV equation (see Appendix C) – and is depicted as a blue line. The purple lines represent the condition $c_3=0$ , included for comparison with predictions from Gardner theory.

For small values of $H_2$ , the solution space always corresponds to either panel $C$ or $D$ , depending on the sign of $c_1$ . As $H_2$ increases, various events – such as the emergence of critical points, the crossing of front curves, their intersection with the curve $c=c_0^+$ and criticality – can significantly affect the structure of the solution space. Figures 17 and 18 capture all the key topological transitions observed in the solution space. More precisely, between two consecutive panels (i.e. successive values of $\delta$ ) the structure of the solution space remains qualitatively similar to that corresponding to the lower value of $\delta$ . It is important to note that, although not indicated in the colour scheme, for any point in parameter space lying above the red curve, along with the solutions described by panel $A$ or $B$ we also have solutions with characteristics of mode-2 corresponding to panel $M2$ in figure 7 (see Doak et al. Reference Doak, Barros and Milewski2022 for further details).

Finally, it is worth noting that any coloured panel corresponding to $\delta \gt 1$ can be transformed into one with $1/\delta$ via the mapping $(x,y)\mapsto (1-x-y,y)$ , with the caveat that the colour scheme is reversed. This reflects the upside-down symmetry inherent in both the Euler and MMCC3 systems, and also highlights the fact that any behaviour exhibited by one interface is mirrored by the other.

6. Numerical solutions for the MMCC3 and Euler systems under the Boussinesq approximation

In this section we adopt the Boussinesq approximation and compare the numerical solutions of the corresponding MMCC3 and Euler systems. Throughout, we choose mass, length and time scales such that $\rho _0=1$ , $H=1$ and $g=1$ . In addition, when referring to the potential associated with the MMCC3 system under the Boussinesq approximation, we mean the potential $V_B$ defined in (B6); for brevity, however, we denote it simply as $V$ . We truncate the infinite domain to a finite interval $[-L,L]$ , ensuring $L$ is sufficiently large such that the numerical solution is invariant to further increase in $L$ . The MMCC3 system is solved using a second-order central difference finite-difference scheme. The three-layer Euler solutions are recovered using the numerical method presented in Guan et al. (Reference Guan, Doak, Milewski and Vanden-Broeck2024), based on a non-local representation of the equations that arise from conformal mappings and boundary integral formulae. All solutions are assumed to be symmetric about $x=0$ , halving the domain over which one has to the numerically solve the system.

We first describe in detail the bifurcation of the solutions corresponding to parameters that gives panels E and F. We choose the parameters $\delta =0.5$ , $\varDelta _2=0.01$ , $H_1/H_3=0.1$ and $H_2=0.3$ . For such parameters, there are three mode-1 conjugate states in the third quadrant of the $(\zeta _1,\zeta _2)$ -plane (see figure 10). The fastest two of these are maxima of the potential $V$ and have speeds $c_2^*\approx 1.059c_0^+$ and $c_1^*\approx 1.042c_0^+$ . The third (slowest) conjugate state corresponds to a saddle of the potential, for which there does not exist any heteroclinic orbits connecting this critical point to the origin. At what speed this critical point is a conjugate state tells us nothing about the solution space.

Figure 19. Mode-1 solitary wave solution branches for the MMCC3 (black solid) and Euler (red dashed) systems with $\delta =0.5$ , $\varDelta _2=0.01$ , $H_2=0.3$ and $H_1/H_3=0.1$ . There are two solution branches, the first bifurcating from $c=c_0^+$ and gradually evolving into a tabletop solitary wave with a limiting speed $c=c_1^*\approx 1.042c_0^+$ . The second branch has speeds $c\gt c_1^*$ . The solutions with speeds close to $c_1^*$ have a tabletop solitary wave-like structure, with an additional hump about $x=0$ . As the speeds increase, the volume of the wave decreases, before again increasing to another tabletop solitary wave with a limiting speed $c=c_2^*\approx 1.059c_0^+$ . The profiles of the solutions (a)( f) marked by black crosses (MMCC3) and red circles (Euler) are illustrated in figure 20. The black dashed vertical lines show the values $c=c_1^*$ and $c=c_2^*$ , to which the solution branches asymptote.

Figure 20. Solutions (a)–( f) from figure 19 for the MMCC3 (black lines) and Euler (red circles, every fifteenth mesh point plotted) systems. When comparing the profiles, we chose solutions such that they have the same volume $Q$ . Solutions (a) and (b) are on the first solution branch, with (c)–( f) being on the second. The speeds of the MMCC3 solutions (a)–( f) are (to four decimal places) $c/c_0^+=1.0380,1.0418, 1.0418,1.0427,1.0529,1.0588$ , while for Euler solutions, they are the same to that accuracy except for solution $(d)$ that has $c/c_0^+=1.0427$ and $(e)$ with $c/c_0^+=1.0532$ .

Figure 21. Trajectories (in red) of solutions (a)–( f) from figure 19 for the MMCC3 model, when projected onto the $(\zeta _1,\zeta _2)$ -plane. The dashed lines represent the coordinate axes. The blue curve is $P=0$ , the black curves are the equipotential $V=0$ and the green (black) dots are saddles (maxima) of $V$ .

Figure 22. Each panel shows the upper interface $\zeta _1(x)$ of a solution of the MMCC3 (black curves) and full Euler (red circles) system. All solutions have $\delta =0.5$ and $\varDelta _2=0.01$ . The total depth is normalised to $H_1+H_2+H_3=1$ , and the undisturbed heights $(H_1,H_2)$ for each panel are $(a)$ $(0.35,0.6)$ , $(b)$ $(0.1,0.8)$ , $(c)$ $(0.2,0.1)$ , $(d)$ $(0.3,0.5)$ , $(e)$ $(0.0636,0.3)$ , $(f)$ $(0.5714,0.4)$ . The parameters for $H_i$ were chosen such that each panel (af) corresponds to the panels AF from figure 7. For a given speed, shown above the panel, there exists either one solution, as in panels (cf), or two solutions, as in panels (ab).

Numerical results reveal there are two solution branches, one with $c\in [c_0^+,c_1^*]$ and the other with $c\in (c_1^*, c_2^*]$ . These are shown in figure 19, where the horizontal axis is the speed and the vertical axis is a measure of the volume of the wave $Q$ , given by

(6.1) \begin{align} Q = \int _0^L |\zeta _1| + |\zeta _2| \, {\textrm{d}}x. \end{align}

We integrate over only half the domain in $x$ since symmetry is assumed. The profiles corresponding to solutions (a)–( f) are shown in figure 20. The first branch bifurcates from zero amplitude and limits to a heteroclinic orbit between the origin and the conjugate state with speed $c=c_1^*$ . Solutions $(a)$ and $(b)$ show this behaviour. To describe the second solution branch, we first highlight the solution in panel $(c)$ . This solution has a speed slightly greater than $c_1^*$ . The solution is similar to the tabletop solitary wave from the first branch (panel b), but is characterised by an additional localised hump along the broadened section. As with tabletop solitary waves, one can compute solutions with arbitrarily large broadened sections as $c\rightarrow c_1^*$ , but for the second branch, the limit is from above. Due to the hump along the broadened section, this solution cannot limit to a heteroclinic orbit, and hence, must exist for speeds strictly greater than $c_1^*$ , unlike the first branch, where the limiting solution is a heteroclinic orbit with $c=c_1^*$ . Following the branch the other way, for a small increase in speed, the broadened section decreases in volume, resulting in a more typical solitary wave profile (solution d). The solution branch then increases in amplitude (solution e), before limiting to a heteroclinic orbit for the second faster conjugate state given by $c=c_2^*$ (solution f).

To better understand this limiting behaviour of the second branch near $c=c_1^*$ , it is helpful to consider the critical points of the potential $V$ . In figure 21 we plot the trajectories of the MMCC3 solutions (a)–(f) projected onto the $(\zeta _1,\zeta _2)$ -plane, shown in this figure as a red curve. We also show for these solutions the critical points of $V$ , with a black and green circle representing maxima and saddles, respectively. Finally, we show the curve $P=0$ (blue, see (4.9)) and the contour $V=0$ (black). Solution $(a)$ is a moderate amplitude solution from the first solution branch. As can be seen, the solution is a homoclinic orbit originating at the origin and reaching the $V=0$ contour, before returning. The solution $(b)$ is a tabletop solitary wave along the first branch. Following the branch further, the solution limits to a heteroclinic orbit, connecting the origin to the non-trivial maximum closer to the origin as $c\rightarrow c_1^*$ from below. For speeds just greater than $c_1^*$ , we have solutions on the second branch, such as solution $(c)$ . This solution has a broadened shelf region, where the amplitude of the shelf is predicted by the conjugate state with speed $c=c_1^*$ . However, since the speed is just greater than $c_1^*$ , the maximum corresponding to that conjugate state has $V\lt 0$ . It can be seen that the solution also has a hump at $x=0$ , giving the profile in panel $(c)$ , a close resemblance to those shown in figure 4 by Dunphy et al. (Reference Dunphy, Subich and Stastna2011) and figure 5 by Makarenko et al. (Reference Makarenko, Maltseva and Kazakov2009). As $c\rightarrow c_1^*$ from above, the broadened section becomes wider and the limiting solution is a near-heteroclinic orbit connecting the origin to the conjugate state, and then a near-homoclinic orbit originating from that same conjugate state. Heuristically, the non-trivial maximum closer to the origin, which is a conjugate state at $c=c_1^*$ , blocks the trajectory from the origin in to the second valley where the additional bump lies for speeds $c\leqslant c_1^*$ . This explains why the two solution branches exist for different ranges of speeds. Following the branch further, as $c$ increases, the broadened shelf shrinks and the solution develops a pyramidal structure, as shown in panel $(d)$ , resembling figure 3 by Derzho (Reference Derzho2017). As the speed increases further, the solution limits to a heteroclinic orbit connecting the origin with the second maximum (solution f).

As mentioned earlier, in terms of partitioning the parameter space into regions associated with distinct types of solutions, the procedure should remain valid within both the MMCC3 and Euler theories. The reason being that all criteria proposed to distinguish the different regions rely on the shared conjugate state structure between the MMCC3 model and the full Euler equations. What remains to be confirmed is that the solution behaviour predicted by the critical point analysis for the MMCC3 model, for each one of those regions, is well reproduced by the solutions to the Euler equations. The numerical results presented above for solutions of the type shown in panels E and F are certainly encouraging. Keeping the same values $\delta =0.5$ , $\varDelta _2=0.01$ , we now choose the values of $H_i$ such that the six regions in figure 11 are covered. Figure 22 shows representative solutions from each panel AF for the Euler (red circles) and MMCC3 (black curves) theories. We show only the upper interface displacement $\zeta _1$ , as for each solution, the lower interface has a comparable shape. For each panel, we fix the speed and show the corresponding solutions. Note that panels $(a)$ and $(b)$ show two solutions, which is only possible since they are in the parameter regime for which there is true multiplicity of solutions. While the quantitative agreement between the MMCC3 and Euler solutions shown here is excellent, we cannot guarantee that this persists across all parameter values. Nonetheless, these computations provide evidence that the bifurcation structure, associated with the existence of conjugate states, is consistent for both MMCC3 and Euler.

We conclude the section by showing that solutions with multiscale structure, cf. panels E and F, persist in the following double-pycnocline stratification:

(6.2) \begin{align} \rho (z) = 1+\frac {\varDelta _2}{2} - \frac {\varDelta _1}{2} - \frac {\varDelta _2}{2}{\text{tanh}\,}{\left ( (z-H_3)/d \right )} - \frac {\varDelta _1}{2}{\text{tanh}\,}{\left ( (z-H_3-H_2)/d \right )}. \end{align}

For this continuous stratification, the steady-state solutions of the Euler equations must satisfy the DJL equation (see Stastna Reference Stastna2022). The parameter $d$ in (6.2) determines the thicknesses of the two pycnoclines; smaller values of $d$ correspond to thinner pycnoclines. Here, we consider small values of $d$ and small density increments ( $d=1/80=0.0125$ , $\varDelta _1=0.005$ , $\varDelta _2=0.01$ ) to enable a meaningful comparison with a three-layer fluid system, as illustrated in figure 23 $(b)$ , under the Boussinesq approximation. We set $H_2=0.3$ and $H_3=7/11 \approx 0.63636$ , or equivalently $H_1/H_3=0.1$ , and compare the solution $(c)$ from figure 19 for the MMCC3 model with an akin solution for the DJL equation. The DJL solution has speed $c=0.0500$ , while the MMCC3 solution has speed $c=0.0505$ . As can be seen from figure 23 $(a)$ , the agreement between the two interfaces of the layered model and the pycnoclines in the continuous stratification is excellent. The DJL solution was computed using a finite-difference method, similar to the one used by Tung et al. (Reference Tung, Chan and Kubota1982). We note that, although the profiles shown in figure 23 resemble those in figure 4 of Dunphy et al. (Reference Dunphy, Subich and Stastna2011), which were also computed for this type of stratification, the latter solutions include a background shear current in the far field.

Figure 23. In panel $(a)$ the black circles show solution $(c)$ from figure 19 for the MMCC3 model. The colourmap shows the density field and the curves of the streamlines of a comparable DJL solution, where the undisturbed density in the far field is given by (6.2). The parameters are set as $d=1/80$ , $\varDelta _1=0.005$ , $\varDelta _2=0.01$ , $H_2=0.3$ and $H_3=7/11$ for which the DJL solution obtained has speed $c=0.0500$ , while the MMCC3 solution has speed $c=0.0505$ . Panel $(b)$ shows the undisturbed density in the far field for the DJL (blue) model, approximated by the piecewise-constant stratification for the the MMCC3 (black) model.

7. Concluding remarks

In this paper we present a classification of mode-1 ISWs in a three-layer fluid. The results contribute to a deeper understanding of the various types of solitary waves that can arise in this physical system. More significantly, criteria are proposed for delineating regions within the parameter space associated with the distinct solution types. This is illustrated in figures 17 and 18, which provide a complete treatment of the problem under the Boussinesq approximation. The figures show how the structure of the solution space changes with different values of $\delta$ . It is evident that, irrespective of the specific value of $\delta$ , when the intermediate layer is sufficiently thin (approximately $H_2/H \lt 0.2$ ), the solitary waves resemble those shown in panels $C$ or $D$ , depending on the values of $H_1$ and $H_3$ . In contrast, for thicker intermediate layers, ISWs exhibits a much richer and complex behaviour. Nevertheless, it seems that a significant part of the parameter region yields solutions similar to those shown in panels AD, and often the regions associated with panels $E$ and $F$ are rather limited. This may help explain why these types of solutions have remained unnoticed in earlier investigations.

The criteria proposed to partition the parameter space into regions associated with distinct types of solutions relies on the shared conjugate state structure between the MMCC3 and Euler equations. As such, we expect the approach to be valid under both theoretical frameworks. What is particularly striking, however, is that the behaviour of solutions based on the critical point analysis for the MMCC3 model is also exhibited by solutions of the Euler equations, e.g. the behaviour illustrated in figure 21 for a solution of the type shown in panel $E$ . In that case, we have demonstrated that the maximum closer to the origin effectively blocks the trajectory from entering orbits that would otherwise approach the loop surrounding the other maximum. This scenario changes when the maximum closer to the origin drops below the threshold where the potential $V \lt 0$ . Remarkably, the same dynamics is observed in both systems. While the correspondence between their critical points is formally established only when they represent conjugate states, these findings suggest that there may be deeper connections between the critical points of the MMCC3 and Euler equations.

Despite the relatively small size of the region associated with panel $E$ in figure 17, we were able to find comparable solution behaviour in the corresponding continuous double-pycnocline stratification (6.2). It would be interesting to explore how variations in the parameter $d$ for the pycnocline thickness affect the structure and extent of the different regions in the parameter space.

Acknowledgements

The authors would like to thank K. Khusnutdinova for helpful discussions and for bringing the paper by Maltseva (Reference Maltseva1989) to their attention. We sincerely thank the anonymous referees for their insightful comments and for bringing to our attention that pyramidal solitons had previously been reported in the literature.

Funding

R.B and P.A.M. would like to acknowledge funding from Leverhulme Trust (RPG-2023-264). A.D. would like to acknowledge funding from EPSRC NFFDy Fellowships (EPSRC grants EP/X028607/1). W.C. was supported by the US National Science Foundation (Grant No. DMS-2108524). The authors would like to thank the Isaac Newton Institute for Mathematical Sciences, Cambridge, for support and hospitality during the programme `Emergent phenomena in nonlinear dispersive waves’, where part of the work on this paper was undertaken. This work was supported by EPSRC grant EP/R014604/1.

Declaration of interests

The authors report no conflict of interest.

Appendix A. Derivation of the Gardner equation

The Gardner equation (2.29) can be obtained using the method presented, e.g. by Choi & Camassa (Reference Choi and Camassa1999), but the derivation is lengthy and tedious. Here we sketch the derivation of the nonlinear terms in the Gardner equation without the linear dispersive term, $\zeta _{1,xxx}$ , which was previously obtained for the KdV equation (Barros et al. Reference Barros, Choi and Milewski2020).

The three-layer (non-dispersive) hydrostatic model that neglects any dispersive effects can be found by disregarding the right-hand sides of the MMCC equations, equations (2.1) and (2.2) in Barros et al. (Reference Barros, Choi and Milewski2020). In a frame of reference moving with the linear long-wave speed $c_0$ , the hydrostatic model can be written, for the local layer thicknesses $h_i(x,t)$ , thickness-averaged velocities $\overline {u}_i$ and the hydrostatic pressures $P_i$ , as

(A1) \begin{equation} h_{i,t}-c_0 h_{i,x}+ (h_i \overline {u}_i)_x=0, \!\quad\! \overline {u}_{i,t}-c_0 \overline {u}_{i,x}+\overline {u}_i \overline {u}_{i,x}+g \eta _{i,x}+{P_{i,x}\over \rho _i} =0 \!\quad\! \mbox{for} \!\quad\! i=1,2,3, \end{equation}

where $h_i$ and $\eta _i$ are defined in (2.1) and figure 1. In (A1), $P_1$ is the unknown pressure on the top rigid surface while $P_2$ and $P_3$ , the pressures at the upper and lower interfaces, respectively, are given by $P_{i+1}=P_i+\rho _i gh_i$ for $i=1,2$ .

After non-dimensionalising all variables with respect to $g$ and $H$ (or equivalently, setting $g=H=1$ ), we expand $w= (\zeta _i, \, u_i,\, P_1)$ for a weakly nonlinear wave as

(A2) \begin{equation} w= \alpha w^{(1)}+\alpha ^2\, w^{(2)}+\alpha ^3\, w^{(3)}+{\cdots} , \end{equation}

where $\alpha$ is a small nonlinear parameter given by $\alpha =a/H\ll 1$ with $a$ being the wave amplitude. After substituting (A2) into (A1) and introducing slow space and time variables given by $\xi =\alpha x$ and $\tau _n=\alpha ^n t$ $(n=1,2,{\cdots} )$ , respectively, one can find equations, e.g. for $\zeta _1^{(n)}$ valid at $O(\alpha ^n)$ .

At $O(\alpha )$ , after expressing $u_i$ and $P_1$ in terms of $\zeta _i^{(1)}$ as

(A3) \begin{equation} u_1^{(1)}={c_0\over H_1}\,\zeta _1^{(1)},\ \ u_2^{(1)}={c_0\over H_2}\,\big(\zeta _1^{(1)}-\zeta _2^{(1)}\big),\ \ u_3^{(1)}={c_0\over H_3}\,\zeta _2^{(1)},\ \ P_1^{(1)}=-{\rho _1c_0^2\over H_1}\,\zeta _1^{(1)}, \end{equation}

a linear system for ${\boldsymbol \zeta ^{(1)}}= (\zeta _1^{(1)}, \zeta _2^{(1)} )^T$ can be obtained as

(A4) \begin{equation} \boldsymbol {A} {\boldsymbol \zeta }^{(n)} =0, \qquad \boldsymbol {A}= \left ( \begin{array}{c c} a_{11} & a_{12} \\[2pt] a_{21} & a_{22} \\[2pt] \end{array} \right )\! , \end{equation}

where $a_{jl}$ ( $j,l=1,2$ ) are given, in dimensional form, by

(A5) \begin{align} a_{11} & = - ( {\rho _1}{H_2} + {\rho _2} {H_1} )\,c_0^2 + g\,( {\rho _2}- {\rho _1}) {H_1} {H_2} , \qquad a_{12}= {\rho _2}\, {H_1}c_0^2, \\[-12pt]\nonumber \end{align}
(A6) \begin{align} a_{21} & =-\rho _1 H_3\,c_0^2 +g\, ({\rho _2}- {\rho _1}) {H_1} {H_3}, \qquad a_{22}= - \rho _3 {H_1}\, c_0^2+ g \,( {\rho _3}- {\rho _2}) {H_1} {H_3} . \end{align}

The fact that the determinant of $\boldsymbol {A}$ given by $a_{11}a_{22}-a_{12}a_{21}$ must vanish for a non-trivial solution for $\boldsymbol \zeta ^{(1)}$ yields the linear dispersion relation for $c_0$ . To find the form in (2.12), it should be noted that

(A7) \begin{equation} \begin{aligned} a_{21} & = {H_3\over H_2}a_{11}+ {\rho _2H_1H_3\over H_2}\,c_0^2, \\ a_{22} & = {H_1\over H_2} \big [ -( {\rho _2}{H_3} + {\rho _3} {H_2} )\,c_0^2 + g\,( {\rho _3}- {\rho _2}) {H_2} {H_3} \big ]+\rho _2{H_1H_3\over H_2}c_0^2. \end{aligned} \end{equation}

At $O(\alpha ^n)$ ( $n=2,3$ ), we have $\boldsymbol {A} {\boldsymbol \zeta }^{(n)} =\boldsymbol f^{(n)}$ with $\boldsymbol f^{(n)}= (f_1^{(n)}, f_2^{(n)} )^T$ , where $f_i^{(2)}$ and $f_i^{(3)}$ ( $i=1,2$ ) are given by

(A8) \begin{align} f_i^{(2)} &= \beta _{i1}^{(2)} \ \zeta _{1,\tau _1}^{(1)}+ \beta _{i2}^{(2)} \ \zeta _{1}^{(1)}\,\zeta _{1,\xi }^{(1)}, \end{align}
(A9) \begin{align} f_i^{(3)} &= \beta _{i1}^{(3)} \ \zeta _{1,\tau _2}^{(1)}+ \beta _{i2}^{(3)} \ \zeta _{1,\tau _1}^{(2)}+ \beta _{i3}^{(3)} \ \zeta _{1}^{(1)}\,\zeta _{1,\xi }^{(2)} + \beta _{i4}^{(3)}\ \zeta _{1}^{(2)}\,\zeta _{1,\xi }^{(1)}+ \beta _{i5}^{(3)} \ {\zeta _{1}^{(1)}}^2\,\zeta _{1,\xi }^{(1)}. \end{align}

Here the expressions for $\beta _{ij}^{(n)}$ are too long and are omitted. Then, by imposing the solvability condition for the non-homogenous system $\boldsymbol {A} {\boldsymbol \zeta }^{(n)} =\boldsymbol f^{(n)}$ given by

(A10) \begin{equation} a_{22} f_1^{(n)}-a_{12} f_2^{(n)} = 0 \quad \mbox{or} \quad a_{21} f_1^{(n)}-a_{11} f_2^{(n)} = 0,\end{equation}

one can obtain the following evolution equations for $\zeta _1^{(n)}$ for $n=1$ and 2 at $O(\alpha ^2)$ and $O(\alpha ^3)$ , respectively:

(A11) \begin{align} \zeta _{1,\tau _1}^{(1)}+ c_1 \zeta _{1}^{(1)} \zeta _{1,\xi }^{(1)} & =0, \end{align}
(A12) \begin{align} \zeta _{1,\tau _2}^{(1)}+ \zeta _{1,\tau _1}^{(2)}+ c_1 \zeta _{1}^{(1)} \zeta _{1,\xi }^{(2)} + c_1 \zeta _1^{(2)} \zeta _{1,\xi }^{(1)}+ c_3 \big({\zeta _1^{(1)}}^3\big)_\xi & =0. \end{align}

Here $c_1$ and $c_3$ are given, in dimensional form, by (2.21) and (2.30), respectively. From $\zeta _1=\alpha \, \zeta _1^{(1)}+\alpha ^2\,\zeta _1^{(2)}+{\cdots}$ , (A12) can be written, after recovering the original variables, as

(A13) \begin{equation} \zeta _{1,t}+c_0\,\zeta _{1,x}+c_1\,\zeta _1\zeta _{1,x}+c_3 \big({\zeta _1}^3\big)_x = 0. \end{equation}

When this is combined with the linear dispersive term given by $c_2\,\zeta _{1,xxx}$ , we obtain the Gardner equation given by (2.29).

Appendix B. Some properties of the curve $P=0$

Consider fixed values for the densities $\rho _i$ and depths $H_i$ ( $i=1,2,3$ ). Clearly, any critical point of $V$ obtained for a given value of $c$ will belong to the curve $P=0$ . We prove here the ‘converse’ statement.

Lemma 1. Let $(\zeta _1,\zeta _2)$ be point of the admissible region $\mathcal{R}$ satisfying $P(\zeta _1,\zeta _2)=0$ . Then, it must be a critical point of $V$ for some real value $c$ .

Proof. Clearly, the origin is a trivial critical point of $V$ for all $c\gt 0$ . Let $(\zeta _1, \zeta _2)\in {\mathcal{R}}\setminus \{(0,0)\}$ , verifying $P=0$ . Then, a value of $c$ can be found such that (4.8) holds, by setting

(B1) \begin{equation} c^2= 2 g(\rho _3-\rho _2) h_2^2 h_3^2 \,\frac {\zeta _2}{G_3(h_2,h_3)}, \end{equation}

with $G_3$ defined by (4.11). For $c$ to be real, it remains to be proven that the right-hand side of (B1) is positive. Note that the level set $G_3=0$ can be viewed as the graph $\zeta _1 = f(\zeta _2)$ for a certain monotonic function $f$ . As a consequence, the curve $G_3=0$ splits $\mathcal{R}$ into two separate connected components. Moreover, $G_3(\zeta _1,\zeta _2)\gt 0$ ( ${\lt } 0$ ) when $(\zeta _1,\zeta _2)$ lies above (below) the level set $G_3=0$ . To prove our claim it suffices then to show that $P=0$ is above (below) $G_3=0$ for $\zeta _2\gt 0$ ( $\zeta _2\lt 0$ ).

From (4.9), it follows that if $(\zeta _1,\zeta _2)$ belongs to $P=0$ then ${\text{sgn}\,}(\zeta _1/\zeta _2) = {\text{sgn}\,} (G_1/G_3)$ . Since $G_3=0$ is included in the first and third quadrants, only these regions need to be considered. We find that $G_1$ and $G_3$ have the same sign on the region between the level sets $G_1=0$ and $G_3=0$ (see figure 24). Given that $G_1=0$ is above (below) the level set $G_3=0$ for $\zeta _2\gt 0$ ( $\zeta _2\lt 0$ ), the result follows.

Figure 24. A typical region on the $(\zeta _1,\zeta _2)$ -plane where $G_1$ and $G_3$ defined by (4.10), (4.11), respectively, have the same sign (shaded region). The level sets $G_1=0$ and $G_3=0$ are shown by a dashed (purple) line and a solid (blue) line, respectively.

We consider now one more important feature for the curve $P=0$ .

Lemma 2. The origin is a crunode of the curve $P=0$ . Moreover, the tangents to the curve at the origin are given by $\zeta _2=\gamma ^\pm \zeta _1$ , as in (2.13 ).

Proof. It is easy to check that the origin satisfies $P=0$ , $ {\partial P}/{\partial \zeta _1}=0$ , $ {\partial P}/{\partial \zeta _2}=0$ , so it is indeed a finite singular point. To show that it is a crunode, we show that, at the origin, two branches of the curve have distinct tangent lines. Locally at the origin, the curve $P=0$ is described by

(B2) \begin{align} & \rho _2(\rho _1-\rho _2) H_1 H_3 \,\zeta _1^2 \nonumber \\ & + \big [ \rho _1 (\rho _2-\rho _3) H_2 H_3 - \rho _2 (\rho _1-2\rho _2+\rho _3) H_1 H_3 - \rho _3 (\rho _1-\rho _2) H_1 H_2 \big ] \zeta _1 \zeta _2 \nonumber \\ & - \rho _2 (\rho _2-\rho _3)H_1 H_3 \,\zeta _2^2 = 0, \end{align}

which splits up into two lines

(B3) \begin{equation} \zeta _1 - v_0^{\pm } \zeta _2 = 0, \end{equation}

where $v_0^+$ and $v_0^-$ have opposite signs and are defined by

(B4) \begin{equation} v_0^{\pm } = \frac {\rho _1 \rho _2 H_3 + \rho _1 \rho _3 H_2 + \rho _2 \rho _3 H_1}{g\rho _2 (\rho _1-\rho _2)H_1 H_3} \left (c_0^{\pm }\right )^2 + \frac {\rho _2 H_3+\rho _3 H_2}{\rho _2 H_3}. \end{equation}

By expanding (2.12) as a quadratic equation for $c^2$ , we see that

(B5) \begin{equation} (c_0^-)^2 (c_0^+)^2 = \frac {g^2 H_1 H_2 H_3 (\rho _1-\rho _2)(\rho _2-\rho _3)}{\rho _1 \rho _2 H_3 + \rho _1 \rho _3 H_2 + \rho _2 \rho _3 H_1}, \end{equation}

which together with (2.14) and (B4) implies that $v_0^\pm = (\gamma ^\mp )^{-1}$ . In other words the tangent lines to the curve $P=0$ at the origin are given by $\zeta _2=\gamma ^\pm \zeta _1$ .

B.1. Boussinesq approximation

The Hamiltonian structure of the dynamical system governing the ISWs of the MMCC3 model is preserved under the Boussinesq approximation. Under this approximation, once all physical parameters are set, the critical points can be found as the solution of $\boldsymbol{\nabla }V_B=(0,0)$ for the corresponding potential $V_B$ , specifically,

(B6) \begin{equation} V_B (\zeta _1,\zeta _2)=\frac {1}{2} \left \{ -c^2 \left ( \frac {\zeta _1^2}{h_1}+\frac {\zeta _2^2}{h_3} + \frac {(\zeta _2-\zeta _1)^2}{h_2} \right ) + g_1^\prime \,\zeta _1^2 + g_2^\prime \,\zeta _2^2 \right \}, \end{equation}

or, equivalently, as the intersection of the following plane algebraic curves:

(B7) \begin{align} {\mathcal{C}}_{1_B} & \equiv \frac {1}{2} c^2 \big ( H_1^2 h_2^2 - H_2^2 h_1^2 \big ) -g_1^\prime \,\zeta _1 h_1^2 h_2^2 =0, \\[-12pt]\nonumber \end{align}
(B8) \begin{align} {\mathcal{C}}_{2_B} & \equiv \frac {1}{2} c^2 \big ( H_2^2 h_3^2 - H_3^2 h_2^2 \big ) -g_2^\prime \,\zeta _2 h_2^2 h_3^2 =0. \end{align}

Once again, the geometrical locus of all critical points can be found by eliminating the wave speed $c$ from these two equations. As a result,

(B9) \begin{equation} P_B (\zeta _1,\zeta _2) \equiv \delta \,\zeta _1 \,h_1^2 \big (H_2^2 h_3^2 - H_3^2 h_2^2\big ) - \zeta _2 \,h_3^2 \big (H_1^2 h_2^2 - H_2^2 h_1^2 \big ) =0, \end{equation}

with $\delta =\varDelta _1/\varDelta _2$ in (2.16). Similarly to what is observed for $P=0$ , there are also two distinct configurations for the curve $P_B=0$ . The transition between the two configurations can be more easily studied here. It transpires that, for $\delta =H_3/H_1$ , the curve $P_B=0$ becomes degenerate and splits up into two components, one of which is the line of equation

(B10) \begin{equation} \zeta _2=-(H_3/H_1) \,\zeta _1. \end{equation}

The coordinates $(\zeta _1,\zeta _2)$ of the extra finite singular point can be determined as follows. First, we solve

(B11) \begin{align} 2H_1^2 H_2 (H_1+H_3) & + H_1 \big ( H_1^2 - 3H_2^2 - 6H_2 H_3 + H_3^2 + 2 H_1 (-3H_2 + H_3)\big ) \zeta _1 \nonumber \\ & +\big ( H_2^2 - 3(H_1+H_3)^2 \big ) \zeta _1^2=0 \end{align}

to find $\zeta _1$ and then use (B10) to find $\zeta _2$ .

Appendix C. Power series expansion of the curves ${\mathcal{C}}_1$ and ${\mathcal{C}}_2$ and critical point collision at the origin

As we have mentioned in the text, in the vicinity of the origin, each curve ${\mathcal{C}}_k=0$ ( $k=1,2$ ) can be expressed as a convergent power series in terms of $\zeta _1$ :

(C1) \begin{equation} \zeta _2 = p_k \zeta _1 + q_k \zeta _1^2 + r_k \zeta _1^3 + \ldots , \quad k=1,2. \end{equation}

Such is the case because at no point of the curves ${\mathcal{C}}_k$ we have $ {\partial {\mathcal{C}}_k}/{\partial \zeta _2}=0$ . Indeed, simple calculations lead to

(C2) \begin{equation} \frac {\partial {\mathcal{C}}_1}{\partial \zeta _2}=-\frac {2}{h_2} \left [ \frac {1}{2}c^2 G_1 - g(\rho _2-\rho _1)\zeta _1 h_1^2 h_2^2 + \frac {1}{2} c^2 \rho _2 H_2^2 h_1^2 \right ]\!, \end{equation}

with $G_1$ defined in (4.10). If $(\zeta _1,\zeta _2) \in {\mathcal{C}}_1=0$ then it follows from (4.7) that

(C3) \begin{equation} \frac {\partial {\mathcal{C}}_1}{\partial \zeta _2} = -\frac {2}{h_2} \left ( \frac {1}{2} c^2 \rho _2 H_2^2 h_1^2 \right ) \lt 0. \end{equation}

Similar steps can be taken to show that $ {\partial {\mathcal{C}}_2}/{\partial \zeta _2}\neq 0$ at any point on the curve ${\mathcal{C}}_2$ .

Now that the conditions are fulfilled to legitimate the use of power series to expand the curves in the vicinity of the origin, it remains to find the coefficients in each expansion. To that purpose, we use Newton’s method (see, e.g. Christensen Reference Christensen1996) and find that

(C4) \begin{align} p_1 & =\frac {R_2}{\rho _2 H_1 c^2}, \quad \quad p_2 = \frac {\rho _2 H_3 c^2}{S_2}, \quad q_1=\frac {3 (\rho _2-\rho _1) H_2 R_4}{2 \rho _2^2 H_1^2 c^4}, \nonumber \\ q_2 & = \frac {3\rho _2 (\rho _2-\rho _3) H_2 H_3 c^2 S_4}{2 S_2^3}, \!\quad r_1= \frac {(\rho _1-\rho _2) H_2 R_6}{2 \rho _2^3 H_1^3 c^6 }, \!\quad r_2 = \frac {\rho _2 (\rho _2-\rho _3)H_2 H_3 c^2 S_8}{2 S_2^5}, \end{align}

where polynomials $R_j$ , $S_j$ ( $j=2,4$ ) are defined as

(C5) \begin{align} R_2 (c) &= (\rho _1H_2 + \rho _2 H_1)\, c^2-g(\rho _2-\rho _1)H_1 H_2, \nonumber\\ S_2 (c) &= (\rho _2 H_3 + \rho _3 H_2) \,c^2-g(\rho _3-\rho _2)H_2 H_3, \\[-12pt]\nonumber \end{align}
(C6) \begin{align} R_4 (c) &= \rho _1 c^2 (c^2 + 2gH_1)+g^2H_1^2 (\rho _1-\rho _2), \nonumber\\ S_4 (c) &= \rho _3 c^2 (c^2-2gH_3) + g^2 H_3^2 (\rho _3-\rho _2), \end{align}

and the polynomials $R_6(c)$ and $S_8(c)$ are given by

(C7) \begin{align} R_6(c) &= 5 \rho _1^2 (c^2+ g H_1)^3 - \rho _1 \rho _2 (c^2 + g H_1) \big ( 4 c^4 + 5 c^2 g H_1 + 10 g^2 H_1^2\big ) + 5 g^3 H_1^3 \rho _2^2, \nonumber\\[3pt] \end{align}
(C8) \begin{align} S_8(c) &= 4 g^4 H_2 H_3^4 (\rho _2-\rho _3)^3 - g^3 H_3^3 (\rho _2-\rho _3)^2 (5 \rho _2 H_3 - 16 \rho _3 H_2)c^2 \nonumber \\ & \quad -3 g^2 H_3^2 \rho _3 (\rho _2-\rho _3) (3\rho _2 H_2 + 5 \rho _2 H_3 - 8 \rho _3 H_2) c^4 \nonumber \\ & \quad + \rho _3 \big (\! -4 \rho _2^2 H_3 + 5 \rho _2 \rho _3 (H_2+H_3) - 4 \rho _3^2 H_2 \big ) c^8 \nonumber \\ & \quad + g H_3 \rho _3 \big ( \rho _2^2 (-4 H_2 + 9 H_3) - 3 \rho _2 \rho _3 (6 H_2+5 H_3)+16 \rho _3^2 H_2 \big ) c^6. \end{align}

For a double collision of critical points to occur at the origin, we must impose $p_1=p_2$ , resulting in

(C9) \begin{equation} Q_4(c) \equiv R_2(c) \, S_2(c) - \rho _2^2 \,H_1 H_3 \,c^4=0, \end{equation}

which is precisely condition (2.12) for the linear long-wave speeds. For a triple collision at the origin, we must have

(C10) \begin{equation} p_1=p_2, \quad q_1=q_2, \end{equation}

which amounts to imposing

(C11) \begin{equation} Q_{10}(c) \equiv (\rho _2-\rho _1) R_4 (c) S_2^3(c) +\rho _2^3 (\rho _3-\rho _2) H_1^2 H_3 c^6 S_4(c)=0 \quad \text{at} \quad c=c_0. \end{equation}

The polynomials $Q_4$ and $Q_{10}$ introduced in (C9) and (C11) play an important role in predicting the polarity of ISWs (see Appendix D).

To simplify the expression of $Q_{10}(c_0)$ , we first observe that from (C9) it follows that $S_2(c_0) = \rho _2 H_3 c_0^2 \,\gamma ^{-1}$ , with $\gamma$ defined by (2.14). Simple calculations lead to

(C12) \begin{align} (\rho _2-\rho _1) R_4(c_0) &=\rho _2^2 c_0^4 \Bigg [ \frac {\rho _1}{\rho _2} - \frac {H_1^2}{H_2^2} (1-\gamma )^2 \Bigg ], \end{align}
(C13) \begin{align} (\rho _3-\rho _2) S_4(c_0) &=\rho _2^2 c_0^4 \Bigg [ -\frac {\rho _3}{\rho _2} + \frac {H_3^2}{H_2^2} (1-\gamma ^{-1})^2 \Bigg ]. \end{align}

When these are put together, simple manipulations of (C11) lead to

(C14) \begin{equation} \frac {\rho _3}{H_3^2} \gamma ^3 + \frac {\rho _2}{H_2^2} (1-\gamma )^3 - \frac {\rho _1}{H_1^2} =0, \end{equation}

which is precisely the criticality condition obtained in (2.23).

Lastly, for a quadruple collision, we must impose

(C15) \begin{equation} p_1=p_2, \quad q_1=q_2, \quad r_1 = r_2. \end{equation}

At the linear long-wave speeds, $r_1=r_2$ amounts to considering

(C16) \begin{equation} (\rho _2-\rho _1) R_6(c_0) S_2^5(c_0) - \rho _2^4 H_1^3 H_3 c_0^8 (\rho _3-\rho _2) S_8(c_0)=0. \end{equation}

We use the same idea employed above to simplify the expression, obtaining

(C17) \begin{align} & -\!4 \rho _1 \rho _2 H_2^3 H_3^4 - 9 \rho _1 \rho _2 H_1 H_2^2 H_3^4 (1-\gamma ) + H_1^3 \big [ \rho _2^2 H_3^4 (1-\gamma )^4 (5+9\gamma ) \nonumber \\ & \quad - \rho _2 \rho _3 H_2^2 H_3 \gamma ^3 [ 4 H_2 \gamma + 9 H_3(1-\gamma )(1-2\gamma )] + 9 \rho _3^2 H_2^4 \gamma ^5\big ]=0. \end{align}

Since a quadruple collision at the origin is obtained by requiring that both (C14) and (C17) are satisfied simultaneously, we may cast (C17) into the form

(C18) \begin{align} & -\!4 \rho _1 \rho _2 H_2^3 H_3^4 - 9 \rho _1 \rho _2 H_1 H_2^2 H_3^4 (1-\gamma ) + H_1^3 \big [ \rho _2^2 H_3^4 (1-\gamma )(1-\gamma )^3 (5+9\gamma ) \nonumber \\ & \quad - \rho _2 \rho _3 H_2^2 H_3 \gamma ^3 [ 4 H_2 \gamma + 9 H_3(1-\gamma )(1-2\gamma )] + 9 \rho _3^2 H_2^4 \gamma ^2 \gamma ^3 \big ]=0 .\end{align}

If we now use (C14) and manipulate the expression as

(C19) \begin{align} & -\!4 \rho _1 \rho _2 H_2^3 H_3^4 - 9 \rho _1 \rho _2 H_1 H_2^2 H_3^4 (1-\gamma ) \nonumber \\ & +H_1^3 \left \{ \rho _2^2 H_3^4 (1-\gamma )\frac {H_2^2}{\rho _2} \left ( \frac {\rho _1}{H_1^2}-\frac {\rho _3}{H_3^2} \gamma ^3 \right ) (5+9\gamma ) \right. \nonumber \\ & - \rho _2 \rho _3 H_2^2 H_3 \gamma ^3 [ 4 H_2 \gamma + 9 H_3(1-\gamma )(1-2\gamma )] \nonumber \\ & + \left. 9 \rho _3^2 H_2^4 \gamma ^2 \frac {H_3^2}{\rho _3} \left ( \frac {\rho _1}{H_1^2}-\frac {\rho _2}{H_2^2} (1-\gamma )^3 \right ) \right \}=0, \end{align}

we conclude that a quadruple collision occurs when the following multivariate polynomial vanishes:

(C20) \begin{align} W & \equiv -4\rho _2\rho _3 H_1^3 H_2^2 H_3 (H_2+H_3) \gamma ^4 + 13 \rho _2 \rho _3 H_1^3 H_2^2 H_3^2 \gamma ^3 \nonumber \\ & \quad -9H_1H_2^2 H_3^2 \big ( \rho _1 \rho _2 H_3^2 - \rho _1 \rho _3 H_2^2 + \rho _2 \rho _3 H_1^2 \big ) \gamma ^2 + 13 \rho _1 \rho _2 H_1 H_2^2 H_3^4 \gamma \nonumber \\ & \quad -4\rho _1\rho _2 H_2^2 H_3^4 (H_1 + H_2)=0. \end{align}

Let us introduce the multivariate polynomials $\tilde {T}_1$ , resulting from $T_1$ in (2.31) according to

(C21) \begin{equation} T_1 = \frac {c_0}{4 \rho _2 H_1^3 H_2^3 H_3^3} \tilde {T}_1, \end{equation}

and $Z$ , resulting from the criticality condition (C14):

(C22) \begin{equation} Z = \rho _3 H_1^2 H_2^2 \gamma ^3 + \rho _2 H_1^2 H_3^2 (1-\gamma )^3 - \rho _1 H_2^2 H_3^2. \end{equation}

Then, it is easy to check that

(C23) \begin{equation} W - H_3 \tilde {T}_1 = - 5 H_1 H_3^2 (1-\gamma )Z. \end{equation}

Hence, at criticality, we have $Z=0$ and so $W=0$ if and only if $\tilde {T}_1=0$ . In other words, the quadruple collision at the origin can be linked to the simultaneous vanishing of the quadratic and cubic nonlinearity coefficients ( $c_1,c_3$ , respectively) of the Gardner equation.

Appendix D. Polarity of interfacial waves

Appendix C shows that, as long as we remain away from criticality, a double collision of critical points will occur at the origin when $c=c_0^\pm$ . One of the points involved in the collision is the origin itself. The other point can be found (approximately) as long as the wave speed $c$ is close enough to $c_0^\pm$ . Indeed, from (4.13), it follows that in a neighbourhood of the origin, $(\zeta _1,\zeta _2) \in {\mathcal{C}}_1 \cap {\mathcal{C}}_2$ provided

(D1) \begin{equation} \zeta _1 \big [ (p_1-p_2) + (q_1 - q_2)\zeta _1 + O\big(\zeta _1^2\big) \big ]=0. \end{equation}

Therefore, at leading order two points of ${\mathcal{C}}_1 \cap {\mathcal{C}}_2$ can be found, with ordinates prescribed by

(D2) \begin{equation} \zeta _1 =0, \quad (q_1-q_2)\zeta _1 = p_2-p_1. \end{equation}

Then, from (C4),

(D3) \begin{equation} p_2-p_1 = \frac {- Q_4 (c)}{\rho _2 H_1 c^2 S_2 (c)}, \quad q_1-q_2 = \frac {3H_2 Q_{10} (c)}{2\rho _2^2 H_1^2 c^4 S_2^3(c)}. \end{equation}

Here, $Q_4$ and $Q_{10}$ are defined in (C9) and (C11). Also, it can be easily checked that

(D4) \begin{align} Q_4(c) \lt 0 & \quad \text{if and only if} \quad \big(c_0^-\big)^2 \lt c^2\lt \big(c_0^+\big)^2, \nonumber \\ S_2(c)\lt 0 & \quad \text{if and only if} \quad c^2\lt g\! H_2 H_3 (\rho _3-\rho _2)/(\rho _2 H_3 + \rho _3 H_2). \end{align}

We examine now the collision at the linear long-wave speed $c_0^-$ . Just before collision, we can assume that $c^2/(c_0^-)^2 = 1-\epsilon$ ( $\epsilon \ll 1$ ), check the sign of $p_2-p_1$ and $q_1-q_2$ and use (D2) to find the way the non-trivial critical point approaches the origin. From (D3) it follows that

(D5) \begin{equation} {\text{sgn}\,} (p_2-p_1) ={\text{sgn}\,} (Q_4) =1, \quad {\text{sgn}\,}(q_1-q_2) = - {\text{sgn}\,} (Q_{10}), \end{equation}

and from (D2),

(D6) \begin{equation} {\text{sgn}\,}(\zeta _1) = {\text{sgn}\,} \left ( \frac {p_2-p_1}{q_1-q_2}\right ) = {\text{sgn}\,}(q_1-q_2) = - {\text{sgn}\,} (Q_{10}). \end{equation}

Therefore, just before collision, the critical point (a saddle of $V$ ) approaches the origin from the left (right) when $Q_{10} \gt 0 \,(\lt 0)$ . Similarly, just after collision, we can assume that $c^2/(c_0^-)^2 = 1+\epsilon$ ( $\epsilon \ll 1$ ) to conclude that

(D7) \begin{equation} {\text{sgn}\,} (p_2-p_1) ={\text{sgn}\,} (Q_4) =-1, \quad {\text{sgn}\,}(q_1-q_2) = - {\text{sgn}\,} (Q_{10}), \end{equation}

and hence,

(D8) \begin{equation} {\text{sgn}\,}(\zeta _1) = {\text{sgn}\,} \left ( \frac {p_2-p_1}{q_1-q_2}\right ) = -{\text{sgn}\,}(q_1-q_2) = {\text{sgn}\,} (Q_{10}). \end{equation}

This means that after collision, the critical point (now a minimum of $V$ ) leaves the origin towards the right (left) when $Q_{10} \gt 0 \,(\lt 0)$ . More can be said about the location of this critical point, before and after the collision at the origin. It suffices to note that (4.13) implies that ${\text{sgn}\,}(\zeta _2) = {\text{sgn}\,}(p_1 \zeta _1) = -{\text{sgn}\,}(\zeta _1)$ , and so during the collision process such a point can only belong to the second and fourth quadrants. This implies, in particular, that any critical point on branch II of $P=0$ will remain on that branch.

Let $\rho _i$ and $H_i$ , $i=1,2,3$ be physical parameters such that $Q_{10} (c_0^-)\neq 0$ . Then, there exist $\epsilon \gt 0$ ( $\ll 1$ ) such $Q_{10}\neq 0$ within the range $1-\epsilon \lt c^2/(c_0^-)^2 \lt 1+\epsilon$ . It can be deduced that mode-2 homoclinic orbits are obtained by leaving the origin in the direction of the unique minimum on branch II. Consequently, $Q_{10}(c_0^-) \gt 0$ ( ${\lt } 0$ ) corresponds to a wave of elevation (depression) for the upper interface. We also observe that for mode-2 waves, ${\text{sgn}\,}(c_1)={\text{sgn}\,}(Q_{10}(c_0^-))$ , indicating that the KdV equation correctly predicts the solution polarity.

The double collision at the origin when $c=c_0^+$ can be examined in an analogous way. As before, it can be established that any critical point on branch I of $P=0$ will remain on that branch. The key difference is that, depending on the parameters considered, two distinct branches of mode-1 solutions may exist. The branch that bifurcates from zero amplitude is the one corresponding to homoclinic orbits departing from the origin in the direction of the saddle on branch I closest to the origin, i.e. the point involved in the collision at the origin at $c=c_0^+$ . This assertion implies that if a branch of solutions bifurcates from zero amplitude, it must exhibit KdV polarity. This results from $-Q_{10} (c_0^+)\gt 0$ ( ${\lt } 0$ ) corresponding to a wave of elevation (depression) of the upper interface, together with the fact that for mode-1 waves, ${\text{sgn}\,}(c_1)=-{\text{sgn}\,}(Q_{10}(c_0^+))$ .

References

Barros, R. 2016 Remarks on a strongly nonlinear model for two-layer flows with a top free surface. Stud. Appl. Math. 136, 263287.10.1111/sapm.12104CrossRefGoogle Scholar
Barros, R., Choi, W. & Milewski, P.A. 2020 Strongly nonlinear effects on internal solitary waves in three-layer flows. J. Fluid Mech. 883, A16.10.1017/jfm.2019.795CrossRefGoogle Scholar
Benjamin, T.B. 1966 Internal waves of finite amplitude and permanent form. J. Fluid Mech. 25, 241270.10.1017/S0022112066001630CrossRefGoogle Scholar
Benjamin, T.B. 1971 A unified theory of conjugate flows. Phil. Trans. R. Soc. A 269 (1201), 587643.Google Scholar
Benney, D.J. 1966 Long non-linear waves in fluid flow. J. Math. Phys. 45, 5263.10.1002/sapm196645152CrossRefGoogle Scholar
Camassa, R., Choi, W., Michallet, H., Rusas, P. & Sveen, J.K. 2006 On the realm of validity of strongly nonlinear asymptotic approximations for internal waves. J. Fluid Mech. 549, 123.10.1017/S0022112005007226CrossRefGoogle Scholar
Choi, W. 2000 Modeling of strongly nonlinear internal waves in a multilayer system. In Proceedings of the Fourth International Conference on Hydrodynamics (ed. Y. Goda, M. Ikehata & K. Suzuki), pp. 453458.Google Scholar
Choi, W. & Camassa, R. 1999 Fully nonlinear internal waves in a two-fluid system. J. Fluid Mech. 396, 136.10.1017/S0022112099005820CrossRefGoogle Scholar
Christensen, C. 1996 Newton’s method for resolving affected equations. College Math. J. 27 (5), 330340.10.1080/07468342.1996.11973804CrossRefGoogle Scholar
Coolidge, J.L. 1959 A Treatise on Algebraic Curves. Dover Publications, Inc.Google Scholar
Cox, D., Little, J., O’shea, D. & Sweedler, M. 1997 Ideals, Varieties, and Algorithms. Springer.Google Scholar
Cox, D., Little, J. & O’Shea, D. 2005 Using Algebraic Geometry. Springer Science & Business Media.Google Scholar
Derzho, O.G. 2017 Brief communication: multiscaled solitary waves. J. Nonlinear Process. Geophys. 24, 695700.10.5194/npg-24-695-2017CrossRefGoogle Scholar
Dias, F. & Il’ichev, A. 2001 Interfacial waves with free-surface boundary conditions: an approach via a model equation. Physica D 150, 278300.10.1016/S0167-2789(01)00149-XCrossRefGoogle Scholar
Doak, A., Barros, R. & Milewski, P.A. 2022 Large mode-2 internal solitary waves in three-layer flows. J. Fluid Mech. 953, A42.10.1017/jfm.2022.974CrossRefGoogle Scholar
Dubreil-Jacotin, M.L. 1932 Sur les ondes de type permanent dans les liquides heterogenes. Atti Accad. Naz. Lincei Rend 6, 814819.Google Scholar
Dunphy, M., Subich, C. & Stastna, M. 2011 Spectral methods for internal waves: indistinguishable density profiles and double-humped solitary waves. Nonlinear Process. Geophys. 18, 351358.10.5194/npg-18-351-2011CrossRefGoogle Scholar
Emelyanov, S.V., Korovin, S.K., Bobylev, N.A. & Bulatov, A.V. 2007 Homotopy of Extremal Problems: Theory and Applications. De Gruyter.10.1515/9783110893014CrossRefGoogle Scholar
Evans, W.A.B. & Ford, M.J. 1996 An integral equation approach to internal (2-layer) solitary waves. Phys. Fluids 8, 20322047.10.1063/1.869006CrossRefGoogle Scholar
Grimshaw, R. 1981 Evolution equations for long nonlinear waves in stratified shear flow. Stud. Appl. Math. 65, 159188.10.1002/sapm1981652159CrossRefGoogle Scholar
Grimshaw, R., Pelinovsky, E. & Talipova, T. 1997 The modified Korteweg-de Vries equation in the theory of large-amplitude internal waves. Nonlinear Process. Geophys. 4, 237250.10.5194/npg-4-237-1997CrossRefGoogle Scholar
Grimshaw, R., Pelinovsky, E. & Talipova, T. 1999 Solitary wave transformation in a medium with sign-variable quadratic nonlinearity and cubic nonlinearity. Physica D 132, 4062.10.1016/S0167-2789(99)00045-7CrossRefGoogle Scholar
Guan, X., Doak, A., Milewski, P. & Vanden-Broeck, J.-M. 2024 Nonlinear travelling periodic waves for the Euler equations in three-layer flows. J. Fluid Mech. 981, A12.10.1017/jfm.2024.73CrossRefGoogle Scholar
Helfrich, K.R. & Melville, W.K. 2006 Long nonlinear internal waves. Annu. Rev. Fluid Mech. 38, 395425.10.1146/annurev.fluid.38.050304.092129CrossRefGoogle Scholar
Kurkina, O.E., Kurkin, A.A., Rouvinskaya, E.A. & Soomere, T. 2015 Propagation regimes of interfacial solitary waves in a three-layer fluid. Nonlinear Process. Geophys. 22, 117132.10.5194/npg-22-117-2015CrossRefGoogle Scholar
Lamb, K.G. 2000 Conjugate flows for a three-layer fluid. Phys. Fluids 12, 21692185.10.1063/1.1287652CrossRefGoogle Scholar
Lamb, K.G. 2023 Interaction of mode-one internal solitary waves of opposite polarity in double-pycnocline stratifications. J. Fluid Mech. 962, A17.10.1017/jfm.2023.284CrossRefGoogle Scholar
Lamb, K.G. & Wan, B. 1998 Conjugate flows and flat solitary waves for a continuously stratified fluid. Phys. Fluids 10 (8), 20612079.10.1063/1.869721CrossRefGoogle Scholar
Lamb, K.G. & Yan, L. 1996 The evolution of internal wave undular bores: comparisons of a fully nonlinear numerical model with weakly nonlinear theory. J. Phys. Oceanogr. 26, 27122734.10.1175/1520-0485(1996)026<2712:TEOIWU>2.0.CO;22.0.CO;2>CrossRefGoogle Scholar
Long, R.R. 1953 Some aspects of the flow of stratified fluids: I. A theoretical investigation. Tellus 5 (1), 4258.10.3402/tellusa.v5i1.8563CrossRefGoogle Scholar
Maltseva, J.L. 1989 Unsteady long waves in a two-layer fluid. In Dinamika Sploshn. Sredy, vol. 93, pp. 96110. (in Russian).Google Scholar
Makarenko, N.I., Maltseva, J.L. & Kazakov, A.Y. 2009 Conjugate flows and amplitude bounds for internal solitary waves. Nonlinear Process. Geophys. 16 (2), 169178.10.5194/npg-16-169-2009CrossRefGoogle Scholar
Miyata, M. 1988 Long internal waves of large amplitude. In Proceedings of the IUTAM Symposium on Nonlinear Water Waves (ed. H. Horikawa & H. Maruo), pp. 399406.10.1007/978-3-642-83331-1_44CrossRefGoogle Scholar
Moum, J.N., Farmer, D.M., Smyth, W.D., Armi, L. & Vagle, S. 2003 Structure and generation of turbulence at interfaces strained by internal solitary waves propagating shoreward over the continental shelf. J. Phys. Oceanogr. 33 (10), 20932112.10.1175/1520-0485(2003)033<2093:SAGOTA>2.0.CO;22.0.CO;2>CrossRefGoogle Scholar
Pelinovsky, E., Tatiana, T. & Tarmo, S. 2021 The structure of algebraic solitons and compactons in the generalized Korteweg–de Vries equation. Phys. D: Nonlinear Phenom. 419, 132785.10.1016/j.physd.2020.132785CrossRefGoogle Scholar
Shroyer, E.L., Moum, J.N. & Nash, J.D. 2010 Vertical heat flux and lateral mass transport in nonlinear internal waves. Geophys. Res. Lett. 37, L08601.10.1029/2010GL042715CrossRefGoogle Scholar
Stastna, M. 2022 Internal waves in the ocean. In Theory and Practice: Surveys and Tutorials in the Applied Mathematical Sciences. Springer Cham.10.1007/978-3-030-99210-1CrossRefGoogle Scholar
Stastna, M. & Lamb, K.G. 2002 Large fully nonlinear internal solitary waves: the effect of background current. Phys. Fluids 14, 2987.10.1063/1.1496510CrossRefGoogle Scholar
Talipova, T.G., Pelinovsky, E.N., Lamb, K., Grimshaw, R. & Holloway, P. 1999 Cubic nonlinearity effects in the propagation of intense internal waves. Dokl. Earth Sci. 365 (2), 241244.Google Scholar
Tung, K.-K., Chan, T.F. & Kubota, T. 1982 Large amplitude internal waves of permanent form. Stud. Appl. Maths. 66, 144.10.1002/sapm19826611CrossRefGoogle Scholar
Turkington, B., Eydeland, A. & Wang, S. 1991 A computational method for solitary internal waves in a continuously stratified fluid. Stud. Appl. Math. 85, 93.10.1002/sapm199185293CrossRefGoogle Scholar
Yang, Y.J., Fang, Y.C., Tang, T.Y. & Ramp, S.R. 2010 Convex and concave types of second baroclinic mode internal solitary waves. Nonlinear Process. Geophys. 17, 605614.10.5194/npg-17-605-2010CrossRefGoogle Scholar
Figure 0

Figure 1. A three-fluid system.

Figure 1

Figure 2. (a) Shows the locus in the parameter space where the criticality condition (2.23) is satisfied, under the Boussinesq approximation with $\varDelta _1=\varDelta _2$. The criticality curve (in blue), along which no ISWs exist according to the KdV theory, splits up into two components on the $(H_1/H, H_2/H)$-plane, one of which is the line $H_2/H=-2 H_1/H + 1$, corresponding to a fixed ratio $H_1/H_3=1$. Also marked on this line is the point with coordinates $(9/26,4/13)$ at which a transition from non-existence to the existence of solitary waves to the mKdV equation (2.25) occurs. This is made clearer in (b) where the mKdV theory is adopted by relaxing the constraint $H_1=H_3$, for which the locus (in thick black) for the coexistence of ISWs of opposite polarity is revealed.

Figure 2

Figure 3. Behaviour of the quadratic and cubic nonlinearity coefficients ($c_1$, $c_3$, respectively) of the Gardner equation (2.29) for the upper interface displacement $\zeta _1(x,t)$, when restricted to mode-1 waves. The stratification is given by (2.16) with $\varDelta _1=\varDelta _2=0.1$. It is shown how the signs for the nonlinearity coefficients may vary across the whole parameter space. The colour scheme shown in panel (b) and used in panel (a) is the same as that set by Kurkina et al. (2015), and allows us to easily distinguish the various solution behaviours. Shaded regions in (b) represent a minimum amplitude along a solution branch.

Figure 3

Figure 4. Behaviour of the quadratic and cubic nonlinearity coefficients ($c_1$, $c_3$, respectively) of the Gardner equation (2.29) for the upper interface displacement $\zeta _1(x,t)$, when restricted to mode-1 waves under the Boussinesq approximation. Different values of $\delta$ are considered, and the same colour scheme as for figure 3 is adopted.

Figure 4

Figure 5. Behaviour of the quadratic and cubic nonlinearity coefficients ($\tilde {c}_1$, $\tilde {c}_3$, respectively) of the Gardner equation (2.34) for the lower interface displacement $\zeta _2(x,t)$, when restricted to mode-1 waves under the Boussinesq approximation. Different values of $\delta$ are considered, and the colour scheme remains consistent with that of figure 3; however, the coordinate axes have been redefined.

Figure 5

Figure 6. Vanishing of the quadratic (in blue) and cubic (in purple) nonlinearity coefficients ($c_1$, $c_3$, respectively) of the Gardner equation (2.29) for $\zeta _1(x,t)$, when restricted to mode-1 waves under Boussinesq approximation. The parameter $\delta$ is set to 1, as in figure 4($b$). The grey lines from the top vertex to the base of the triangle correspond to different ratios $H_1/H_3$. Note that a fixed ratio $H_1/H_3=m$ determines the line with the equation $H_2/H = -(1+m^{-1} ) H_1/H +1$ in the parameter space. From left to right we have: $H_1/H_3=0.8, \, \approx 2.5155, \,20$. For $H_1/H_3\gt 2.5155$, according to the Gardner theory for $\zeta _1$ no ISWs of opposite polarity can coexist. This will be shown to be in disagreement with the predictions by the MMCC3 and Euler theories (e.g. see figure 9$a$).

Figure 6

Figure 7. Possible solution spaces for solitary waves with speeds $c\gt c_0^+$. Here, all solution branches are limited in amplitude. For any set of parameters, the bifurcation of mode-1 waves for the Euler and MMCC3 systems is described by one of the panels A–F. Furthermore, for $H_2$ sufficiently large, one finds mode-2 solitary waves, with a bifurcation structure described in Doak et al. (2022), represented here by panel $M2$. When applicable, shaded regions represent a minimum amplitude along a solution branch. Panels A and B show two branches of solitary waves of opposite polarity, both evolving into tabletop solitary waves. Panel B has a finite amplitude bifurcation for the depression branch, while the elevation branch bifurcates from zero amplitude. The opposite is true for panel A. Panels C and D show a single solitary wave branch of depression and elevation, respectively, developing into a tabletop solitary wave. Panels E and F are new bifurcation structures. There exist two branches, both of depression (panel E, $c_1\lt 0$) or elevation (panel F, $c_1\gt 0$). One branch (shown in black) bifurcates from zero amplitude and limits to a tabletop solitary wave. A second branch (shown in blue) exists with speeds strictly greater than those on the first branch. The branch starts at the same limiting tabletop solitary wave as the first branch, but the corresponding profiles include an additional bump on the broadened section. As one proceeds along the branch via continuation, the broadened section gradually diminishes until the solution takes the form of a conventional solitary wave. As the amplitude continues to increase, the wave broadens again and approaches a different tabletop solitary wave with a larger limiting amplitude. These new branches are described in greater detail in § 6.

Figure 7

Figure 8. Solutions to the conjugate state (3.1), under the Boussinesq approximation. In each panel we plot the branches of conjugate flow speeds that exist for a fixed ratio $H_1/H_3$ and varying $H_2/H$. The physical parameters here correspond to those in figure 6. Panels (a)(c) show results for increasing values of $H_1/H_3$, respectively $H_1/H_3=0.8, \, \approx 2.5155, \,20$. The colour scheme in panel $(d)$ allows us to identify the quadrant of the $(\zeta _1,\zeta _2)$-plane to which the corresponding conjugate state belongs to. The black dashed curves are the linear long-wave speeds.

Figure 8

Figure 9. Region (shaded) of coexistence of ISWs of opposite polarity for different values of $\delta$, according to the strongly and fully nonlinear theories, under the Boussinesq approximation. Instances where the conjugate flow speed matches the linear long-wave speed $c=c_0^+$ are represented in blue (dots for intersections and a solid line for tangency points). It can be observed that the set of tangency points coincides with the locus corresponding to the criticality condition (2.23) for the KdV equation. The purple curves correspond to the vanishing of the cubic nonlinearity coefficient for the Gardner equation for $\zeta _1$, and are shown to illustrate the stark difference between Gardner and MMCC3 predictions. According to the Gardner theory for $\zeta _1$, coexistence of ISWs exist only in the region enclosed by this curve (see, e.g. figure 4b).

Figure 9

Figure 10. Solutions to the conjugate state (3.1), under Boussinesq approximation, for $\delta =0.5$ and $H_1/H_3=0.1$, using the same colour scheme as figure 8. The figure also includes vertical dotted curves, which deliminate the various solution space behaviours given by panels A–F and $M2$ in figure 7. Panel $(b)$ shows a blow-up of the region marked by a box in panel $(a)$. In panel $(b)$ we highlight the appearance of two additional conjugate state branches (a maximum and a slower saddle) with $c\gt c_0^+$ with a black square, the self-intersection of two mode-1 conjugate states (both maxima) with $c\gt c_0^+$ with a grey square and mode-1 conjugate states passing through $c=c_0^+$ with a blue circle.

Figure 10

Figure 11. All details of our conjugate state analysis encapsulated in one figure. In panel $(a)$ we identify in the parameter region for $\delta =0.5$ all relevant instances for the conjugate curves. The main markers are defined in figures 9 and 10$(b)$. In addition, the red line denotes the set of all intersections between a red conjugate curve and the linear long-wave speed $c=c_0^+$ (see figure 10$a$ for large values of $H_2/H$). This information can be used to delimit regions where different properties for mode-1 ISWs can be found, as in panel $(b)$. The colour scheme used here is outlined in figure 7. The purple lines represent the condition $c_3=0$, included for comparison with predictions from Gardner theory for $\zeta _1$.

Figure 11

Figure 12. Plots for the curve $P=0$ in (4.9). We set $(H_1,H_2,H_3)=(3,1,6)$ and adopt the stratification (2.16) with $\varDelta _1=\delta \varDelta _2$ and $\varDelta _2=0.1$. $(a)$$\delta =1$, $(b)$$\delta \approx 1.70824$, $(c)$$\delta =2$. When $c\approx 0$, there are seven critical points of $V$: three local maxima (M), three saddles (S), one minimum (m).

Figure 12

Figure 13. Instances in the parameter space where a quadruple collision occurs at the origin. Here, the Boussinesq approximation is adopted and, for each value of $\delta =\delta _0$, the number of quadruple collisions may be read from the plot by counting how many times the horizontal line $\delta =\delta _0$ intersects the graph.

Figure 13

Figure 14. Plots of the curve $P=0$ in (4.9) and points where non-trivial collisions occur. The stratification is given by (2.16), with $\varDelta _1= \varDelta _2=0.1$, and the undisturbed thicknesses of each layer parameters are set as follows: $(a)$$(H_1,H_2,H_3)=(2,6,1)$, $(b)$$(H_1,H_2,H_3)=(1,7,2)$, $(c)$$(H_1,H_2,H_3)=(3,1,6)$, $(d)$$(H_1,H_2,H_3)=(4,1,2)$. The points in black (red) correspond to collisions leading to a decrease (increase) of the number of critical points. In each panel we overlay the zero contour of the potential $V$, obtained for $(c/c_0^+)^2=1.01$. The mode-1 ISWs for the physical parameters in panels $(a)$$(d)$ resemble, respectively, those illustrated in panels $A$$D$ of figure 7.

Figure 14

Figure 15. Collision curves on the $(H_2/H,c/\sqrt {gH})$-plane for the stratification (2.16), with $\varDelta _1= \varDelta _2=0.1$. $(a)$$H_1/H_3=0.5$, $(b)$$H_1/H_3=2$. The black dashed curves are the linear long-wave speeds, at which trivial collisions occur. Non-trivial collisions are represented by dotted (solid) lines when leading to a decrease (increase) of the number of critical points. We adopt the same colour scheme as in figure 8 to reveal the quadrant of the $(\zeta _1,\zeta _2)$-plane where the collisions occur.

Figure 15

Figure 16. In panel $(a)$, similar to figure 15, collision curves are displayed on the $(H_2/H,c/\sqrt {gH})$-plane for the parameters $\varDelta _1=0.2$, $\varDelta _2=0.1$ and $H_1/H_3=10$. The two other panels illustrate distinct situations in panel $(a)$, obtained with different values of $H_2/H$: $(b)$$H_2/H=0.28$, $(c)$$H_2/H=0.6$. In both plots we set $H_1=1$ and overlay the zero contour of the potential $V$, obtained for $(c/c_0^+)^2=1.01$.

Figure 16

Figure 17. Partition of the (Boussinesq) parameter space into regions associated with distinct solution types. The dotted curves in each panel are the instances in the parameter space at which the front curve crosses the curve $c=c_0^+$. The blue curve corresponds to criticality, at which the front curve and the curve $c=c_0^+$ are tangent. The other markers in the panels (when applicable) are the black and grey squares discussed in figure 11$(b)$. The red curve represents the instance when the front curve bearing characteristics of mode-2 crosses the curve $c=c_0^+$. In the region above this red line, not only coexistence of ISWs of opposite polarity exist, but also large amplitude solutions with mode-2 characteristics. The panels show results for different values of $\delta$. The colour scheme used here is outlined in figure 7. The purple lines represent the condition $c_3=0$, included for comparison with predictions from Gardner theory for $\zeta _1$.

Figure 17

Figure 18. Same as in figure 17, but for $\delta \gt 1$.

Figure 18

Figure 19. Mode-1 solitary wave solution branches for the MMCC3 (black solid) and Euler (red dashed) systems with $\delta =0.5$, $\varDelta _2=0.01$, $H_2=0.3$ and $H_1/H_3=0.1$. There are two solution branches, the first bifurcating from $c=c_0^+$ and gradually evolving into a tabletop solitary wave with a limiting speed $c=c_1^*\approx 1.042c_0^+$. The second branch has speeds $c\gt c_1^*$. The solutions with speeds close to $c_1^*$ have a tabletop solitary wave-like structure, with an additional hump about $x=0$. As the speeds increase, the volume of the wave decreases, before again increasing to another tabletop solitary wave with a limiting speed $c=c_2^*\approx 1.059c_0^+$. The profiles of the solutions (a)( f) marked by black crosses (MMCC3) and red circles (Euler) are illustrated in figure 20. The black dashed vertical lines show the values $c=c_1^*$ and $c=c_2^*$, to which the solution branches asymptote.

Figure 19

Figure 20. Solutions (a)–( f) from figure 19 for the MMCC3 (black lines) and Euler (red circles, every fifteenth mesh point plotted) systems. When comparing the profiles, we chose solutions such that they have the same volume $Q$. Solutions (a) and (b) are on the first solution branch, with (c)–( f) being on the second. The speeds of the MMCC3 solutions (a)–( f) are (to four decimal places) $c/c_0^+=1.0380,1.0418, 1.0418,1.0427,1.0529,1.0588$, while for Euler solutions, they are the same to that accuracy except for solution $(d)$ that has $c/c_0^+=1.0427$ and $(e)$ with $c/c_0^+=1.0532$.

Figure 20

Figure 21. Trajectories (in red) of solutions (a)–( f) from figure 19 for the MMCC3 model, when projected onto the $(\zeta _1,\zeta _2)$-plane. The dashed lines represent the coordinate axes. The blue curve is $P=0$, the black curves are the equipotential $V=0$ and the green (black) dots are saddles (maxima) of $V$.

Figure 21

Figure 22. Each panel shows the upper interface $\zeta _1(x)$ of a solution of the MMCC3 (black curves) and full Euler (red circles) system. All solutions have $\delta =0.5$ and $\varDelta _2=0.01$. The total depth is normalised to $H_1+H_2+H_3=1$, and the undisturbed heights $(H_1,H_2)$ for each panel are $(a)$$(0.35,0.6)$, $(b)$$(0.1,0.8)$, $(c)$$(0.2,0.1)$, $(d)$$(0.3,0.5)$, $(e)$$(0.0636,0.3)$, $(f)$$(0.5714,0.4)$. The parameters for $H_i$ were chosen such that each panel (af) corresponds to the panels AF from figure 7. For a given speed, shown above the panel, there exists either one solution, as in panels (cf), or two solutions, as in panels (ab).

Figure 22

Figure 23. In panel $(a)$ the black circles show solution $(c)$ from figure 19 for the MMCC3 model. The colourmap shows the density field and the curves of the streamlines of a comparable DJL solution, where the undisturbed density in the far field is given by (6.2). The parameters are set as $d=1/80$, $\varDelta _1=0.005$, $\varDelta _2=0.01$, $H_2=0.3$ and $H_3=7/11$ for which the DJL solution obtained has speed $c=0.0500$, while the MMCC3 solution has speed $c=0.0505$. Panel $(b)$ shows the undisturbed density in the far field for the DJL (blue) model, approximated by the piecewise-constant stratification for the the MMCC3 (black) model.

Figure 23

Figure 24. A typical region on the $(\zeta _1,\zeta _2)$-plane where $G_1$ and $G_3$ defined by (4.10), (4.11), respectively, have the same sign (shaded region). The level sets $G_1=0$ and $G_3=0$ are shown by a dashed (purple) line and a solid (blue) line, respectively.