Hostname: page-component-77f85d65b8-8wtlm Total loading time: 0 Render date: 2026-03-29T15:03:45.711Z Has data issue: false hasContentIssue false

Neural network approaches for sea surface height predictability using sea surface temperature

Published online by Cambridge University Press:  02 January 2025

Luther Ollier*
Affiliation:
LOCEAN, IPSL, Paris, France
Sylvie Thiria
Affiliation:
LOCEAN, IPSL, Paris, France
Carlos E. Mejia
Affiliation:
LOCEAN, IPSL, Paris, France
Michel Crépon
Affiliation:
LOCEAN, IPSL, Paris, France
Anastase Charantonis
Affiliation:
Anastase Charantonis, INRIA, IPSL, Paris, France
*
Corresponding author: Luther Ollier; Email: luther.ollier@locean.ipsl.fr

Abstract

Sea Surface Height Anomaly (SLA) is a signature of the mesoscale dynamics of the upper ocean. Sea surface temperature (SST) is driven by these dynamics and can be used to improve the spatial interpolation of SLA fields. In this study, we focused on the temporal evolution of SLA fields. We explored the capacity of deep learning (DL) methods to predict short-term SLA fields using SST fields. We used simulated daily SLA and SST data from the Mercator Global Analysis and Forecasting System, with a resolution of (1/12)° in the North Atlantic Ocean (26.5–44.42°N, −64.25–41.83°E), covering the period from 1993 to 2019. Using a slightly modified image-to-image convolutional DL architecture, we demonstrated that SST is a relevant variable for controlling the SLA prediction. With a learning process inspired by the teaching-forcing method, we managed to improve the SLA forecast at 5 days by using the SST fields as additional information. We obtained predictions of 12 cm (20 cm) error of SLA evolution for scales smaller than mesoscales and at time scales of 5 days (20 days) respectively. Moreover, the information provided by the SST allows us to limit the SLA error to 16 cm at 20 days when learning the trajectory.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. The map shows the sea surface temperature (SST) of the North Atlantic region under study, with the black box indicating the dynamic area of focus.

Figure 1

Figure 2. (a) SLA residuals (t = 24/10/1993) and its evolution 5 (b) and 10 (c) days after. SLA evolution at 5 (e) and 10 (f) days compared to the previous state: $ SLA\left(t+k\right)- SLA(t),k=5(10) $. (d) Standard deviation of the SLA over the Gulf Stream region estimated using the 20 years.

Figure 2

Figure 3. SLA-Res-U-net Architecture: A typical U-net architecture comprises two main components. Initially, the encoder reduces spatial resolution to capture patterns and incorporates feature channels for context propagation. Then, the decoder expands the resolution features from the encoder, and its output is combined with the input image to act as a residual unit. Batch normalization is added between the 2D convolutions during descent and after SiLU activation during ascent. Hyper-parameters like activation, depth, and learning rate are determined via Bayesian optimization (S0).

Figure 3

Figure 4. Training scheme: During the training phase, the network is fed with both SLA prediction and SST observations which extends the horizon of SLA. By performing backpropagation over several time steps, the model is forced to learn the dynamics of SLA.

Figure 4

Table 1. Input/output configurations and naming conventions for the various setups. Noted that for each setup, there are four input fields spaced five days apart, used to generate the subsequent five-day field

Figure 5

Table 2. RMSE according to predictions in time estimated from the Test set

Figure 6

Figure 5. RMSE for different time steps and models estimated on the whole test set. The red box is the persistence RMSE, the blue box corresponds to the SLA-SST/SLA-SST model, the green one to the DY-SLA-SST/SLA-SST, and the purple one to the DY-SLA-SST/SLA.

Figure 7

Figure 6. SLA image predictions at time t + 5 day; t + 10 days; t + 15 days; t + 20 days (t = 24/10/1993). (a) correspond to the ground truth results, (b) to the SLA-SST/SLA-SST method, (c) correspond to the DY SLA-SST/SLA-SST method, and (d) to the DY-SLA-SST/SLA method.

Supplementary material: File

Ollier et al. supplementary material

Ollier et al. supplementary material
Download Ollier et al. supplementary material(File)
File 409.1 KB

Author comment: Neural network approaches for sea surface height predictability using sea surface temperature — R0/PR1

Comments

No accompanying comment.

Review: Neural network approaches for sea surface height predictability using sea surface temperature — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

>>Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

The paper addresses the problem of low temporal availability of measurements of sea level anomaly (SLA) compared with those of sea surface temperature (SST). It tries to address this with a deep learning approach, where predictions are made for future time steps of SLA and SST based on previous snapshots. They test a combination of input/output configurations, and compare results in terms of root mean squared error. The main conclusion is that including SST as an input improves predictions of SLA.

>>Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

The paper is relevant as it uses a deep learning approach applied to climatic variables such as measurements of the ocean surface, with future work suggesting connections to ocean heat content.

>>Detailed Comments:

The paper presents an interesting approach, but there are several issues that need to be addressed for this work to be published.

Major comments:

- In the abstract, submesoscale dynamics are mentioned as the main motivations of this approach, yet the model used is at 1/12 resolution, which is **not** a submesoscale resolving model. Thus, claims that the NN predictions do well or poorly below the mesoscale do not really make sense.

- Furthermore, in the mesoscale, we would expect SST and SLA to be VERY correlated. It is in the submesoscale where we expect geostrophic balance to break down. A good benchmark for this would be to predict SLA based on SST using geostrophy alone and compare that to DY-SST-SLA/SLA shown in the paper.

- Please clarify if you are training on the northern domain and making prediction on the southern domain, or vice versa? Have you tested generalization between the two domains?

- If you show decorrelation timescales of SLA and SST it may help explain why persistence is doing so well.

- It’s hard to evaluate what 13cm means in the RMSE. Compared to figure 2, this seems like a really large error, on the order of magnitude of the signal itself. Since this is the key result of the paper it needs to be clear why this is considered a good one.

- The discussion in page 8 lines 28-33 is very bold: comparing the the deep learning method with PDE integrals and finite difference methods. Given that the data used to train and test the method in this paper is given from simulated data, any errors in the finite difference based numerical simulation will also be present in the predictions of the NN. If anything, we see that the NN has errors as well, so to make this type of claim you would need to run an independent numerical simulation with the SLA/SST given as initial conditions and make a proper comparison. I would suggest to remove this part, or support it with further analysis or appropriate references.

Minor comments:

- It is not very intuitive to use SLA for “sea surface height anomaly”. Why not use SSHA? Or SLA for “sea level anomaly”?

- Please explain what is meant by “more robust dynamical evolution” (page 2 line 48)?

- It would be helpful to draw the two domains used in the study on the box in figure 1.

- Fix equation formatting in page 4 lines 8+9.

- Figure 4: please clarify what are the input and outputs. Perhaps show the different experiments for input/outputs.

- It needs to be made clearer what are the initial conditions used for SLA and SST and how you might get those from satellite data. It is explained only in the conclusion but I think that point should be made clearer early on.

- It would be good to mention the SWOT satellite mission and how your results might be applied to new high resolution data.

- The term “SLA dynamics” are used extensively throughout the paper but it is not clear what is meant by that.

- In general, I found the various notation for the different models and experiments very confusing. Consider using a table to help clarify what are the different inputs and outputs for each, including the DY’s.

- The paper would benefit from more references throughout. In particular, in the context of ocean dynamics and the relationship between SLA and SST.

Recommendation: Neural network approaches for sea surface height predictability using sea surface temperature — R0/PR3

Comments

This article was accepted into Climate Informatics 2024 Conference after the authors addressed the comments in the review input provided. It has been accepted in Environmental Data Science on the strength of the Climate Informatics review process.

Decision: Neural network approaches for sea surface height predictability using sea surface temperature — R0/PR4

Comments

No accompanying comment.