Hostname: page-component-77f85d65b8-v2srd Total loading time: 0 Render date: 2026-03-27T15:28:04.540Z Has data issue: false hasContentIssue false

Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms

Published online by Cambridge University Press:  04 November 2024

Sam J. Silva*
Affiliation:
Department of Earth Sciences, The University of Southern California, Los Angeles, CA, USA
Mahantesh M. Halappanavar
Affiliation:
Physical and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
*
Corresponding author: Sam J. Silva; Email: samsilva@usc.edu

Abstract

Atmospheric chemical reactions play an important role in air quality and climate change. While the structure and dynamics of individual chemical reactions are fairly well understood, the emergent properties of the entire atmospheric chemical system, which can involve many different species that participate in many different reactions, are not well described. In this work, we leverage graph-theoretic techniques to characterize patterns of interaction (“motifs”) in three different representations of gas-phase atmospheric chemistry, termed “chemical mechanisms.” These widely used mechanisms, the master chemical mechanism, the GEOS-Chem mechanism, and the Super-Fast mechanism, vary dramatically in scale and application, but they all generally aim to simulate the abundance and variability of chemical species in the atmosphere. This motif analysis quantifies the fundamental patterns of interaction within the mechanisms, which are directly related to their construction. For example, the gas-phase chemistry in the very small Super-Fast mechanism is entirely composed of bimolecular reactions, and its motif distribution matches that of an individual bimolecular reaction well. The larger and more complex mechanisms show emergent motif distributions that differ strongly from any specific reaction type, consistent with their complexity. The proposed motif analysis demonstrates that while these mechanisms all have a similar design goal, their higher-order structure of interactions differs strongly and thus provides a novel set of tools for exploring differences across chemical mechanisms.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Sample graph of a single bimolecular reaction.

Figure 1

Figure 2. All possible 3-node motif isomorphism classes are studied in this work, along with species- and reaction-centered chemical explanations.

Figure 2

Figure 3. The three 3-node motifs present in the bimolecular reaction are shown in Figure 1. Motifs are shown as red arrows, and their motif isomorphism classes are labeled (see Figure 2).

Figure 3

Figure 4. Distribution of motifs for all six isomorphism classes across all three chemical mechanisms studied in this work.

Figure 4

Figure 5. The fraction of isomorphism classes centered on the HOx and NOx chemical families.

Figure 5

Figure 6. The scaled z-score of the isomorphism class prevalence in each of the three mechanisms is compared to a random baseline. Transparent bars are not statistically significant.

Figure 6

Figure 7. The mean pseudo-oxidant concentration with time across 1000 random Super-Fast baseline graphs (black line). The mean concentration for the 95%ile and 5%ile for each isomorphism class are shown in red and blue lines, respectively. The standard error of the mean estimate is shown in the shaded areas.

Supplementary material: File

Silva and Halappanavar supplementary material

Silva and Halappanavar supplementary material
Download Silva and Halappanavar supplementary material(File)
File 111.1 KB

Author comment: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R0/PR1

Comments

Dear Editor,

I am writing to submit our paper titled “Graph Characterization of Higher Order Structure in Atmospheric Chemical Reaction Mechanisms ” for consideration in Environmental Data Science. The manuscript focuses on using modern graph theoretical methods to understand emergent structures within atmospheric chemical mechanisms.

In this work, we address the need to characterize the structure of interactions in chemical mechanisms and assess the differences between the structural characteristics of different mechanisms. We quantify patterns of interaction in the mechanism through counting so-called “motifs”. These are statistically significant small patterns of connectivity (i.e., subgraphs), that have been shown in prior work to provide useful information on the structure and function of complex networks. Our findings reveal distinct motif distributions across different types of atmospheric chemical mechanisms, providing a novel framework for intercomparison and structural assessment.

We believe that our research contributes significantly to the field of atmospheric chemistry and data science by offering novel insights into the emergent structural of chemical mechanisms. This sort of analysis has never been done before in this application space, and the results provide important information for contextualizing mechanism differences.

Thank you for considering our work for publication.

Sam Silva, Ph.D.

Review: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

This paper seeks to use graph-theoretic methods to explore the structure and dynamics of three chemical mechanisms of varying complexity (the MCM, GEOS-Chem and Super-Fast). This work focusses on the emergent properties of these mechanisms; in particular, the structures studied here are called “motifs”. These are repeating structures/subgraphs which typically reoccur in the larger graph that describes the overall mechanism.

The contribution of this paper is to introduce motifs in the context of graphs of chemical mechanisms, and to count the occurrence of motifs that contain three nodes for each of the three mechanisms studied. The differing prevalence of motifs across the three chemical mechanisms are then compared to each other, and to a random baseline graph.

In general, tying the occurrence of these motifs to physical interpretations or performance metrics would be welcome before this paper is published. I also have some questions about specific technical details in the paper, where clarification may help the reader.

Please find major and minor comments below, and then a couple of typos.

Major comments

Overall, I cannot quite grasp whether there is a physically meaningful or useful link between the motifs and the dynamics/interpretation of the chemical mechanisms. From the presented work, it seems true that we can say that these mechanisms are different according to this measure of their graph structure – that they each have a different ratio of isomorphism classes, and that the prevalence of some of the classes, for some of the mechanisms, seems different to a random graph. However, I can’t grasp whether this tells us anything useful about the chemistry or the dynamics of the mechanism? Perhaps with a comparison to predictions of the mechanisms, one might be able to say: e.g. ‘mechanisms with proportionally more of isomorphism X are significantly more likely to exhibit behaviour YYY?’ Furthermore, if this graph structure was rooted in something physical, this would also be very interesting. I think the paper would be improved by adding an analysis of the dynamic behaviour of the mechanisms, and investigating whether there is some link to the isomorphisms.

Page 7, lines 30-33: ‘The agreement in prevalence between chemical mechanisms disappears beyond the first isomorphism class. In some cases, the z-score may be in a similar direction (e.g., class 2, 3, and 5) but not all mechanisms have significant class prevalence or have wildly different magnitudes.’

If you ‘normalize them to the z-score associated with the first isomorphism class’, then can you say that the ‘agreement in prevalence…disappears beyond the first isomorphism class’? It appears in Figure 6 that all isomorphism class 1s have scaled Z-Score = 1…so the agreement in prevalence is simply that they will agree exactly due to the normalization? I assume, since all three mechanisms have scaled Z-score = 1, then you have divided each mechanism Z-score by the value of isomorphism 1 Z-score for that mechanism? If we are interested in the ‘wildly’ different magnitudes of the Z-scores, then won’t that just depend on which isomorphism class we choose to scale with? This seems like it would have a significant effect in light of the differing prevalence of isomorphism 1 across mechanisms in Figure 4. Some more clarity would be welcome here.

Page 8, line 8-11: ‘work exploring the direct connection between motif prevalence and dynamical system behavior would be valuable. This would provide additional context for interpreting the implications of the differences shown in this comparative analysis.’

I think this would be valuable, and perhaps necessary. It would be good at least to outline what such a connection might be – without this, I think it is hard for a non-expert reader to understand what benefit might be gained from this analysis of graph structure. It is currently not clear to me. Perhaps the behaviour of each mechanism could be probed and linked to the number of isomorphism X in the mechanism? Perhaps it would be tricky with only three mechanisms to draw any confident conclusions?

Minor comments

Page 6, line 23-24: ‘This indicates that in the MCM, and to a lesser extent GEOS-Chem, nearly all reactions wherein a reactant is also a product are reactions that involve at least one of: OH, HO2, NO, or NO2.’

Could a similar conclusion be reached by simply counting the number of reactions in the mechanism that contain one of these species, where the reactant is also a product? If so, what is the additional insight provided by this method?

Page 7, lines 39-43: ‘However, in the GEOS-Chem mechanism, isomorphism class 2

is not statistically significantly represented in the connectivity pattern, and isomorphism class 4 is statistically significantly less likely to occur than random. The emergent structure of chemical interactions in these mechanisms differ substantially, despite the fact that all three mechanisms are dominantly composed of biomolecular reactions.’

Could you provide the number of bimolecular reactions in each mechanism? Do we simply see e.g. more/less bimolecular reactions in the MCM/GEOS-Chem vs another mechanism, and does this cleanly correlate with isomorphism class prevalence?

Page 3, lines 52-53: why is the number of nodes different to the sum of the number of unique species and the number of unique reactions for GEOS-Chem and the MCM? For the SuperFast mechanism we see 38 nodes = 18 unique species + 20 unique reactions, which seems expected. I can’t quite understand why the MCM has fewer nodes than even the number of unique reactions? Perhaps this could be clarified in the main text?

Figure 1.: While this is a useful figure, it would be very useful to also include an extended version of this figure, where a reaction such as e.g. D + E -> F + A is also included, and how would this look in the graph setup? Currently with only the single Figure it is somewhat difficult to visualise the overall graph, and this would be useful for the reader. Perhaps the full graph (or a subset) for the SuperFast mechanism could be included in an appendix.

Figure 2: For the bidirectional arrow – I assume e.g. for class 5, this is for reactions of the form e.g. A + B -> A + C + D? It might be good to include a quick figure like Figure 2, but illustrating class 5 or 6 in an appendix.

Typos

Page 7, lines 42-43: ‘despite the fact that all three mechanisms are dominantly composed of biomolecular reactions.’

Page 6, lines 49-50: ‘We further assess the motif prevalence in the graphs by comparison to the motif prevalence of structurally similar but randomly generated baseline graphs.’

Review: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

Overview: I thoroughly enjoyed this article and I thought it was a great start in using graph theoretical tools for the characterization of chemical mechanisms. The work is presented clearly, and the results are meaningful and easy to interpret. I would recommend this paper for publication after addressing the following comments.

Specific Comments:

P2 Lines 27-28: The method from Wiser et al was based on graph theory but was a new method, not one used in combustion chemistry.

P4 Figure 2: It would be helpful to show the nodes rather than just the edges, as someone unfamiliar with the field might not realize where the nodes are.

Page 6 Line 50: What algorithm or method was used to generate these random graphs (is it igraph)? Randomized graphs might become a standard for mechanism comparisons, and it is good to have replicable randomization techniques. Is it possible that different randomization techniques could lead to different motif distributions? Is there a reason your random graphs would be considered neutral?

Page 7 Lines 30-42: When discussing these z scores and the relative prevalence of motifs, it would be helpful to connect back to the actual mechanistic description. For example, GeosCHEM has fewer of motif 4 than random, which, if I am interpreting correctly, means it has fewer reactions with multiple products or repeat reactions for a reactant. It would be helpful to explain this and why that might be.

Page 7 Summary Section: It would be helpful here to describe specific differences between the three mechanisms in addition to generalizations. For example, mechanism X had more of motifs 1,2, 4 which suggests more reactions of type Y. It does not need to be too in depth, but just a recap of the notable differences between the mechanisms.

General Comments:

In your method you list out six motifs each with two isomorphisms, one reaction-centered and one species-centered. Did you do any analysis with all twelve of the classes, treating the isomorphisms as separate entities? If not, what is the reasoning for combining these isomorphisms, and could there be anything to gain from separating them out?

Did you try any larger motifs? It might be beyond the scope of this work, but I am curious if there is anything to be gained from 4/5 node motifs.

Because the motifs are small, they are highly interpretable. I would add more language throughout, ties in the motifs to the mechanistic characteristics, such as “more reactions with multiple products” or “more reactions with a species as a reactant and a product”. It can be easily pieced together by the reader as well, but it helps to reiterate the interpretation of the motif distributions.

Have you looked into the ways that scale and use case impact these motif distributions? The superfast mechanism is meant to represent a core set of reactions, how does its distribution reflect that? Do any of the differences between the GeosCHEM and MCM mechanisms arise from their relative sizes, or do you think it has more to do with the way they are made?

Can the motif count be biased by the way a mechanism is written. For example, if you have:

A --> 0.5B + 0.5C, k = 1 in one mechanism and A--> B, k= 0.5, A-->C, k = 0.5, in another mechanism, would this lead to different motif distributions even though the chemistry is more or less the same?

Recommendation: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R0/PR4

Comments

The referees seem pretty happy with your MS, which I am really pleased with as it’s an interesting paper.

Both reviewers agree that the MS could be improved with better explanation of the way in which motifs are connected to the chemistry in the mechanism (e.g. “occurrence of these motifs to physical interpretations or performance metrics would be welcome” & “ I would add more language throughout, ties in the motifs to the mechanistic characteristics, such as “more reactions with multiple products” or “more reactions with a species as a reactant and a product”.” & “ It would be helpful here to describe specific differences between the three mechanisms in addition to generalizations. ”) and I agree with their judgement.

I think the easier it is for someone coming from a more conventional mechanism development or box modelling background to understand how the motifs connect to the chemistry in which they’re interested, the more impact your paper will have so hope these are useful suggestions.

Decision: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R0/PR5

Comments

No accompanying comment.

Author comment: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R1/PR6

Comments

Thank you for the thoughtful reviews, they have improved the manuscript. We have added additional analysis and text that we believe appropriately address the reviewer comments.

Review: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

Thank you for addressing the comments of the reviewers. Section 5 is a useful addition that helps to suggest how the motifs may relate to system behaviour, and the language is appropriate here. I do not have further comments, and I am happy to accept the publication.

Review: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R1/PR8

Conflict of interest statement

no competing interests

Comments

The authors have significantly improved the manuscript with this revision.

Recommendation: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R1/PR9

Comments

Thank you for the additional text connecting the graph approach to the ODE approach. This has really helped and has improved an already fantastic manuscript.

Decision: Graph characterization of higher-order structure in atmospheric chemical reaction mechanisms — R1/PR10

Comments

No accompanying comment.