Hostname: page-component-5db58dd55d-mhzq2 Total loading time: 0 Render date: 2026-06-02T02:00:35.078Z Has data issue: false hasContentIssue false

Detecting unique wind field features in hurricane Sandy from topological data maps

Published online by Cambridge University Press:  06 April 2026

Justin Hoffmeier*
Affiliation:
Applied Mathematics, Florida Polytechnic University, USA

Abstract

This study investigates the use of topological data maps for extracting unique tropical cyclone (TC) wind field features. These maps are presented as graphs generated through a sequence of steps that filter, cluster, and identify data structure, and are used to characterize topological properties and shape in the data. The objective and scope of the method is explored through application to wind fields from the HURDAT2 data set, and its viability for detecting anomalous behavior in TCs is considered. We refer to the resulting graphs as wind field connectivity signatures (WFCS) or collective wind field connectivity map (CWFCM), depending on the data set. Our focus is Hurricane Sandy, where the method successfully identifies a complete 360-degree rotation of the high wind speed radii. This cyclical example of phase rotation of wind speed asymmetries corresponds to a distinct structural property of the graph. These methods have not been previously applied to wind field data and have only seen limited use in atmospheric sciences.

Information

Type
Methods Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Table 1. Sample data point from the cleaned HURDAT2 data set

Figure 1

Figure 1. Graphical example of the Mapper pipeline from Escolar et al. (2023). This low-dimensional geometric toy example results in a map that visualizes some of the key shapes and structures in the original point cloud. Data that falls in the overlap of two bins are marked in red in Step 3; dashed lines indicate these repeated points between bins.

Figure 2

Figure 2. The maps displayed in this figure were each constructed with a different projection space. Other choices in the Mapper pipeline were not changed; we used 10 overlapping intervals with $ 75\% $ overlap for each dimension, and agglomerative clustering within bins. Nodes are colored by average maximum sustained wind speed (MSW) relative to Philippe. The red clusters in the maps using the radii of the eastern middle wind speed, northeast low wind speed, and western low wind speed projection spaces correspond to Philippe’s time at hurricane status. Philippe’s WFCS, using the final parameter settings, is given in Figure 5.

Figure 3

Figure 3. A sample of maps from Hurricane Sandy was computed by varying the covering parameters, number of intervals, and percent overlap, providing maps of varying resolution. Each pair of parameter values defines a new cover of the low wind speed eastern quadrants radius projection space. Maps are then created using agglomerative clustering. Each map in the figure has the corresponding pair of covering parameter values displayed above it. Nodes are colored by maximum sustained wind speed (MSW), standardized relative to Sandy. In Results, we reference the hole in Sandy’s WFCS, which we see emerging in the figure, being more pronounced along $ \left(\mathrm{7,0.65}\right) $, $ \left(\mathrm{9,0.70}\right) $, and $ \left(\mathrm{9,0.75}\right) $. For our results, we defined WFCSs at $ \left(\mathrm{10,0.75}\right) $.

Figure 4

Figure 4. We used agglomerative clustering in step 3 of the WFCS construction, which has the parameter: number of clusters per bin. We conducted a sensitivity analysis of this parameter for several TCs. The maps for Hurricane Nadine, as we varied the number of clusters per bin from 3 to 6, are shown. Nadine’s maps become more fragmented, which is also typical of other TCs. However, storm features are detected across the settings. For example, when the number of clusters per bin is set to 3, Nadine’s strongest wind fields are detected through its location at the end of the red flare. When the setting is set to 6, the same wind fields have emerged as the disconnected component with 3 deep red nodes.

Figure 5

Table 2. Metrics measuring how well WFCSs retain the temporal ordering of wind fields for a sample of Hurricanes: A WFCS node represents a cluster of wind fields that may have occurred out of temporal order, and edges may further connect these out-of-sequence wind fields

Figure 6

Figure 5. WFCSs for a sample of TCs. From left to right: Hurricanes Igor, Nadine, Joaquin, and Philippe. Nodes are colored by maximum sustained wind speed (MSW), standardized relative to each respective storm. Nodes corresponding to hurricane status are highlighted in gray. In some cases, hurricane status corresponds to graphical properties; for example, the flare in Hurricane Nadine and the disconnected component in Hurricane Philippe.

Figure 7

Figure 6. Hurricane Sandy’s WFCS. In the main component, note the appearance of a hole in the upper left portion, an unusual feature among other WFCSs in our catalog. To help describe this feature and classify Sandy’s wind fields, we defined node clusters (i.e., clusters of clusters of wind fields) based on their position and connectivity. In this figure, node clusters 1, 2, and 3 are circled by pink, lavender, and gray rings, respectively. Nodes are colored by average maximum sustained wind speed (MSW) relative to Sandy.

Figure 8

Table 3. WFCS generated classification timeline of Hurricane Sandy

Figure 9

Figure 7. Six polar charts illustrating the temporal progression of the high wind speed distribution in Hurricane Sandy from October 27th to October 30, corresponding to node cluster 3 in Sandy’s WFCS (see Figure 6). Each chart represents the maximal observed high wind speed radius (in miles, indicated by the arcs and radial labels) for each of the four directional quadrants (NE, SE, SW, NW) during the specified time frame. Charts 1, 2, and 4 each represent multiple snapshots where the high wind speed was similar throughout the respective time durations. Quadrants without an arc have high wind speed radii of zero during the corresponding time frame, and help define the asymmetry in the wind field distribution at that time. A 360-degree cycle of asymmetry in the wind field distribution can be seen, moving predominantly from the Northwest sector (1) through the Southwest and Southeast, concluding in the Northeast sector (6).

Figure 10

Figure 8. The figure displays the CWFCM, which differs from WFCSs since the data is from all TCs, 2007–2015. Most nodes in this figure represent clusters comprised of wind fields from multiple TCs. Nodes are colored by average maximum sustained wind speed (MSW) relative to all TCs, 2007–2015. The CWFCM offers another tool for suggesting unusual structure. It reinforces our claim that the wind fields from node cluster 3 in Sandy’s WFCS are anomalous since they also reside in nodes on a well-defined flare in the CWFCM; the CWFCM nodes where these wind fields reside are circled by the gray ring in the figure. To determine if a wind field resides on a flare, we computed the number of edges between each node and a node identified as the central anchor.

Figure 11

Table 4. Four centrality metrics were computed for each node in the CWFCM

Author comment: Detecting unique wind field features in hurricane Sandy from topological data maps — R0/PR1

Comments

Dear Editors,

I am pleased to submit this paper, Detecting Unique Wind Field Features in Hurricane Sandy From Topological Data Maps, to Environmental Data Science. This paper is submitted for consideration as part of the journal’s forthcoming Special Issue Connecting Data-Driven and Physical Approaches: Application to Climate Modeling and Earth System Observation. This paper engages with the scope and aims of the Special Issue by presenting an innovative, data-driven method and application for the identification of wind field features in tropical cyclones. I can confirm that this paper is original and has not been submitted elsewhere. I declare no conflict of interest.

Sincerely,

Justin Hoffmeier

Review: Detecting unique wind field features in hurricane Sandy from topological data maps — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

The paper is concise, well-written, and clearly illustrates the use of a powerful tool from TDA to a novel application in meteorological science. Overall, Mapper and the WFCS map are well-explained and illustrated with a nice use of figures (although I have a few suggestions on tweaks for a couple of figures). The case study on Hurricane Sandy is somewhat secondary to the development of the WFCS signature, which is appropriate given that it is a methods paper. My main comments are about wanting more clarification and explicit descriptions in a few places, detailed below.

Major Comments:

1. The main tool used throughout this paper is the graph output of Mapper, including graphically analyzing the embeddings of those graphs. Given the importance of this tool in the paper, I would really like to see a bit of explanation on how these embeddings are chosen. This is particularly relevant given the discussion of the “hole” in the graph for Hurricane Sandy, as is (at least as described) a feature of the embedding rather than the abstract graph.

2. Section 4: In the analysis of Hurricane Sandy, much of the discussion centers around three “node clusters” – these are defined as being regions connected by “relatively few edges”. How were these clusters identified, and is there a more rigorous definition? If they were subjectively identified from how a particular embedding and upon analysis turned out to be meaningful that is a reasonable approach but should be spelled out.

3. Section 4.1.2: In this section, there is much discussion of the “phase of the asymmetry” rotating – please explicitly define what is rotating here. By “the expanse of the high wind speed”, I believe that you mean the quadrant in which the high wind radius is largest, but it is not at all clear. Similarly, in Figure 5, I understand that these are the high wind radii, but why are there now 8 potential values per time rather than the 4 quadrants used elsewhere, and why are the circles not complete?

4. Section 3.2.6: This whole subsubsection is quite muddled and informal. I am all for including more informal discussions to give intuition, but this section feels like it missed the mark. This feels like a section that could be improved by a few pieces of concrete notation and a figure.

Minor Comments:

1. Page 2, Line 12: The reference formatting on “Saggar, Shine et al 2022” seems atypical.

2. Page 2, Line 15: “Diurnal” is misspelled as “Dinural”.

3. Page 2, Line 39: The citation starting with Hallam et al 2019 should not be parenthetical here.

4. Table 1: In each line, you have one radius, so it should be specified as such rather than the plural “radii”.

5. Page 3, Line 22: In the citations for Mapper, the first author’s last name is “van Veen” – “van” is part of the last name, not a middle name as the citation software appears to have assumed.

6. Figure 1: I know that this figure is taken from Escolar et al., but if there is a way to add (either to the figure or in the discussion) some explicit indication of the points that are identified in Step 3, it would make it more clear that some of these points are repeated between bins. Ideally, this would be some sort of matched coloring or dashed line between identified points, but I understand if that isn’t practical given that this is an adapted figure.

7. Page 4, Line 10: The citation for Mapper here inside the parenthetical discussion should still be a parenthetical citation.

8. Section 3.2.2: Is the same set of covers used for all storms analyzed, or is a new set of 100 covers generated for each storm depending on the range of radii for that storm?

9. Figure 2: I have two comments about this figure. First, why is Eastern Low Wind Speed (the choice used throughout the rest of the paper) not included in this diagram for comparison? Second, the ordering of the plots here seems unintuitive, particularly the mixing and matching of ordering by quadrant(s) and wind speeds.

10. Page 10, Line 2: The Brandon et al. citation should be parenthetical.

11. Page 10, Line 3: It’s -> Its.

12. Page 13, Line 41: The portion of the sentence “measured how far … by computing a metric and then compared” is awkward.

13. Page 14, Line 20: The end of the paragraph “had a unique attribute and possible extreme TC behavior” is also awkward.

Review: Detecting unique wind field features in hurricane Sandy from topological data maps — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

The author proposed a Topological Data Maps (TDM) approach, specifically utilizing the Mapper algorithm, to generate graphs that represent the structure of tropical cyclones. The connectivity of these graphs was further analyzed, revealing interesting patterns that reflect the behavior of hurricanes. The novelty of this work lies in the application of Topological Data Analysis (TDA) to hurricane data, which has not been explored by others. This is the main novelty highlighted in the manuscript. The following comments can be considered to improve the work.

1. Highlighting the novelty merely as the application of TDA to a dataset or problem that has not been explored before is not sufficient. The author should emphasize what specific novelty or new insights are obtained in the context of the environmental problem, and how this adds value beyond what traditional approaches can achieve.

2. There is no comparative analysis between the traditional approach and the proposed approach, which raises questions about the effectiveness of the applied methodology. The author should discuss the effectiveness of the proposed approach in the manuscript.

3. The authors do not include temporal information in their analysis, although this is an important variable for identifying the months or seasonal durations when the public needs to take precautionary measures. Temporal information can be directly embedded as node member names in the graph, allowing this to be visualized directly without needing to provide the percentage sequence analysis described in Table 2. Please clarify how, without such temporal information in the graph, the findings can be translated into actionable precautionary steps for the public.

4. The authors color the graph based on Maximum Sustained Wind Speed. Did the authors consider other variables for coloring the graph, such as year, month, day, or seasonal categories? It would be more valuable to include additional visualizations based on different parameters to allow for deeper insights.

5. Table 4 presents metrics for evaluating the structure of the graph. However, the manuscript does not explain what these values mean or how they relate to the environmental problem. Please elaborate on the interpretation of these metrics and their relevance to the study context.

6. Based on the results obtained, it would be valuable to discuss whether the proposed approach can be utilized for forecasting purposes. If such an application is feasible, it should be highlighted as a potential direction in the future work section, as this would significantly enhance the practical value of the study.

Minor comments:

• et al should be written as et al. and formatted consistently throughout the manuscript.

• Please include the name of the hurricane (or tropical cyclone) that is the focus of the study in the abstract, so readers can immediately identify the specific case being investigated.

Recommendation: Detecting unique wind field features in hurricane Sandy from topological data maps — R0/PR4

Comments

No accompanying comment.

Decision: Detecting unique wind field features in hurricane Sandy from topological data maps — R0/PR5

Comments

No accompanying comment.

Author comment: Detecting unique wind field features in hurricane Sandy from topological data maps — R1/PR6

Comments

Dear Editors,

We are pleased to resubmit our manuscript, “Detecting Unique Wind Field Features in Hurricane Sandy From Topological Data Maps,” for consideration in the Environmental Data Science Special Issue: Connecting Data-Driven and Physical Approaches.

In response to the reviewers’ constructive feedback, we have implemented improvements to strengthen the manuscript’s rigor and clarity. New text has been marked in red font for the reviewer’s convenience - it can be easily removed if the manuscripts proceeds toward publication. Some of the primary updates include:

Methodological Rigor: Added explicit graph-theoretical definitions and validation for the Mapper embeddings and clustering results.

Comparative Analysis: Included a new subsection (3.2.9) comparing our TDA-based approach with traditional magnitude-based and Fourier-based metrics.

Enhanced Visualization: Completely revised Figure 7 to provide a clearer temporal representation of Hurricane Sandy’s structural rotation.

Impact: Refined the introduction and impact statement to better articulate the methodological shift toward objective structural anomaly detection.

A detailed, point-by-point response to all reviewer comments has been provided in the associated response field. We believe these revisions successfully address the reviewers' concerns while highlighting the unique value of our methodology. Thank you for your continued consideration of our work.

Sincerely,

Justin Hoffmeier

jhoffmeier@floridapoly.edu

Review: Detecting unique wind field features in hurricane Sandy from topological data maps — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

The author(s) have addressed all of my concerns. I recommend acceptance.

Review: Detecting unique wind field features in hurricane Sandy from topological data maps — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

I am happy to say that the authors satisfactorily addressed all of my comments in the revised version of the manuscript. In response to their question about Section 3.2.6, I think that the new version is much clearer, and does indeed add value to the paper and should be retained.

I also appreciate the authors replacing Figure 7 (the authors are correct - I had incorrectly referred to this as Figure 5 in my initial review) with a much clearer figure. With the clearer figure, another (more minor) point of confusion was exposed for me: why do the durations and spacings of the 6 time periods examined vary so widely? One time period is over 24 hours, while several are single time snapshots. Do these correspond to nodes in cluster 3 of the WFCS? The figure is certainly interpretable and clear as is (the caption states that the RMW are the maximum radii over the given time periods, for periods that span more than one observation) but the uneveness of the chosen time periods stood out to me, and if there is a simple explanation that could be added to the caption, I think that would be a useful addition.

Recommendation: Detecting unique wind field features in hurricane Sandy from topological data maps — R1/PR9

Comments

No accompanying comment.

Decision: Detecting unique wind field features in hurricane Sandy from topological data maps — R1/PR10

Comments

No accompanying comment.