Hostname: page-component-77f85d65b8-9nbrm Total loading time: 0 Render date: 2026-03-29T06:39:56.610Z Has data issue: false hasContentIssue false

Unsupervised clustering identifies thermohaline staircases in the Canada Basin of the Arctic Ocean

Published online by Cambridge University Press:  06 May 2024

Mikhail G. Schee*
Affiliation:
Department of Physics, University of Toronto, Toronto, ON, Canada
Erica Rosenblum
Affiliation:
Department of Physics, University of Toronto, Toronto, ON, Canada Centre for Earth Observation Science, University of Manitoba, Winnipeg, MB, Canada
Jonathan M. Lilly
Affiliation:
Planetary Science Institute, Tucson, AZ, USA
Nicolas Grisouard
Affiliation:
Department of Physics, University of Toronto, Toronto, ON, Canada
*
Corresponding author: Mikhail G. Schee; Email: mikhail.schee@alumni.utoronto.ca

Abstract

Thermohaline staircases are a widespread stratification feature that impacts the vertical transport of heat and nutrients and are consistently observed throughout the Canada Basin of the Arctic Ocean. Observations of staircases from the same time period and geographic region form clusters in temperature-salinity (TS) space. Here, for the first time, we use an automated clustering algorithm called the hierarchical density-based spatial clustering of applications with noise to detect and connect individual well-mixed staircase layers across profiles from ice-tethered profilers. Our application only requires an estimate of the typical layer thickness and expected salinity range of staircases. We compare this method to two previous studies that used different approaches to detect layers and reproduce several results, including the mean lateral density ratio $ {R}_L $ and that the difference in salinity between neighboring layers is a magnitude larger than the salinity variance within a layer. We find that we can accurately and automatically track individual layers in coherent staircases across time and space between different profiles. In evaluating the algorithm’s performance, we find evidence of different physical features, namely splitting or merging layers and remnant intrusions. Further, we find a dependence of $ {R}_L $ on pressure, whereas previous studies have reported constant $ {R}_L $. Our results demonstrate that clustering algorithms are an effective and parsimonious method of identifying staircases in ocean profile data.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Table 1. Details of the ice-tethered profilers (ITPs) used in this study

Figure 1

Figure 1. A map showing the locations of all profiles used from ITPs 2 and 3, showing the whole Arctic in (a) and a zoomed-in view in (b). The red box designates the Canada Basin as defined by Peralta-Ferriz and Woodgate (2015).

Figure 2

Figure 2. Results from the clustering algorithm with $ {m}_{pts}=170 $ and $ \mathrm{\ell}=25 $ dbar run on 53,042 data points in the salinity range 34.05–34.75 g/kg from all up-going ITP2 profiles. (a) The data in$ \Theta $$ {S}_P $ space with dashed lines of constant potential density anomaly ($ \mathrm{kg}\;{\mathrm{m}}^{-3} $) referenced to the surface. The red box bounds the clusters marked in panels (b) and (d). (b) Profiles 183, 185, and 187 from ITP2 in a limited pressure range to show detail. Each profile is offset in $ {S}_P $ for clarity. (c) The spatial arrangement used as input for the algorithm where the gray points are noise and each color-marker combination indicates a cluster. The same color-marker combinations are used in each panel, and the markers in panels (c) and (d) are at the cluster average for each axis. (d) A subset of the data in $ \alpha \Theta $$ \beta {S}_P $ space with the linear regression line and inverse slope ($ {R}_L $) noted for each individual cluster and with dashed lines of slope $ \alpha \Theta /\beta {S}_P=1 $.

Figure 3

Figure 3. A parameter sweep showing the number of clusters found (solid lines) and DBCV (dashed lines) in ITP2 as a function of (a) 27 different values of $ \mathrm{\ell} $ with $ {m}_{pts}=170 $ and (b) 44 different values of $ {m}_{pts} $ with $ \mathrm{\ell}=25 $ dbar.

Figure 4

Table 2. The values of parameters used to run the clustering algorithm over both datasets

Figure 5

Figure 4. The average (a) pressure, (b) $ \Theta $, and (c) $ {S}_P $ for the points within each cluster for each profile (profile cluster average, PCA) across time. The clustering algorithm was run with $ {m}_{pts}=580 $ and $ \mathrm{\ell}=25 $ dbar on 678,575 data points in the salinity range of 34.21–34.82 g/kg from all up-going ITP3 profiles.

Figure 6

Table 3. The median normalized inter-cluster ranges and differences between average values of adjacent clusters for ITP2 and ITP3 calculated after removing outliers with z-score$ >2 $ in the respective variable

Figure 7

Figure 5. The value of each cluster’s normalized inter-cluster range for salinity $ {IR}_{S_P} $ in (a) and (c) and the lateral density ratio $ {R}_L $ in (b) and (d) as a function of the cluster’s average pressure. The colors and markers for ITP2 in (a) and (b) are the same as the clustering shown in Figure 2 and for ITP3 in (c) and (d), they are the same as shown in Figure 4. Markers circled in red indicate outliers with a z-score greater than $ 2 $. (b) The solid curve is a second-degree polynomial fit (equation given by the annotation) for the non-outlier points from ITP2 and the dashed curve is the same for ITP3. (d) The solid curve is a second-degree polynomial fit (equation given by the annotation) for the non-outlier points from ITP3, and the dashed curve is the same for ITP2.

Figure 8

Figure 6. Individual $ {S}_P $ profiles from ITP2, specifically chosen to show the examples of outlier clusters in $ {IR}_{S_P} $ and $ {R}_L $ highlighted by the bands of color. (a) Profiles 67, 69, 73, 75, 81, 83, 91, 93, 97, and 99 collected between August 31 and September 5, 2004. (b) Profiles 87, 89, 95, 97, 99, 101, 103, 105, 109, and 111 collected between September 3 and 7, 2004. The colors and markers are the same as the clustering shown in Figure 2. The gray dots are noise points, and the black lines show the profiles. Each profile is offset in $ {S}_P $ for clarity.

Figure 9

Figure 7. The average $ \Theta $ and $ {S}_P $ for the 34 layers found by Lu et al. (2022) in black dots and for the 43 layers found in our study using data from ITP3 and the same colors and markers as in Figure 4. Clusters circled in red are outliers in either $ {IR}_{S_P} $ or $ {R}_L $.

Figure 10

Figure 8. Individual profiles 313, 315, 317, 319, and 321 from ITP3, collected between November 10 and 12, 2005, specifically chosen to show the example of a temperature inversion highlighted by the bands of color. The colors and markers of the individual points are the same as the clustering shown in Figures 4 and 7. The gray dots are noise points. The black lines on the left of each pair are the $ {S}_P $ profiles, while the red lines on the right are for $ \Theta $. Each profile is offset in both $ {S}_P $ and $ \Theta $ for clarity.

Supplementary material: File

Schee et al. supplementary material

Schee et al. supplementary material
Download Schee et al. supplementary material(File)
File 2.3 MB