Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-06T04:07:38.419Z Has data issue: false hasContentIssue false

Exact recovery of Granger causality graphs with unconditional pairwise tests

Published online by Cambridge University Press:  06 June 2023

R. J. Kinnear
Affiliation:
Electrical and Computer Engineering, University of Waterloo, Waterloo, Ontario, Canada
R. R. Mazumdar*
Affiliation:
Electrical and Computer Engineering, University of Waterloo, Waterloo, Ontario, Canada
*
Corresponding author: R. R. Mazumdar; Email: mazum@uwaterloo.ca
Rights & Permissions [Opens in a new window]

Abstract

We study Granger Causality in the context of wide-sense stationary time series. The focus of the analysis is to understand how the underlying topological structure of the causality graph affects graph recovery by means of the pairwise testing heuristic. Our main theoretical result establishes a sufficient condition (in particular, the graph must satisfy a polytree assumption we refer to as strong causality) under which the graph can be recovered by means of unconditional and binary pairwise causality testing. Examples from the gene regulatory network literature are provided which establish that graphs which are strongly causal, or very nearly so, can be expected to arise in practice. We implement finite sample heuristics derived from our theory, and use simulation to compare our pairwise testing heuristic against LASSO-based methods. These simulations show that, for graphs which are strongly causal (or small perturbations thereof) the pairwise testing heuristic is able to more accurately recover the underlying graph. We show that the algorithm is scalable to graphs with thousands of nodes, and that, as long as structural assumptions are met, exhibits similar high-dimensional scaling properties as the LASSO. That is, performance degrades slowly while the system size increases and the number of available samples is held fixed. Finally, a proof-of-concept application example shows, by attempting to classify alcoholic individuals using only Granger causality graphs inferred from EEG measurements, that the inferred Granger causality graph topology carries identifiable features.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. Examples illustrating the difficulty of obtaining a converse to Proposition 2.4. (a) Path cancelation: $j \in{\mathcal{A}({i})} \nRightarrow j \overset{\text{PW}}{\rightarrow } i$. (b) Cancellation from time lag: $j \in{pa(i)} \nRightarrow j \overset{\text{PW}}{\rightarrow } i$.

Figure 1

Figure 2. Reproduced under the creative commons attribution non-commercial license (http://creativecommons.org/licenses/by-nc/2.5) is a known gene regulatory network for a certain strain of E.Coli. The graph has only a single edge violating strong causality.

Figure 2

Figure 3. Reproduced under the creative commons attribution license (https://creativecommons.org/licenses/by/4.0/) is a much larger gene regulatory network. It exhibits similar qualitative structure (a network of hub nodes) as does the much smaller network of Figure 2.

Figure 3

Algorithm 1: Pairwise Granger Causality Algorithm

Figure 4

Figure 4. Black arrows indicate true parent-child relations. Red dotted arrows indicate pairwise causality (due to non-parent relations), green dash-dotted arrows indicate bidirectional pairwise causality (due to the confounding node $1$). Blue groupings indicate each $P_k$ in Algorithm 1.

Figure 5

Figure 5. Representative random graph topologies on $N = 50$ nodes.

Figure 6

Table 1. Simulation results—PWGC vs AdaLASSO

Figure 7

Figure 6. Measures support recovery performance as the number of nodes $N$ increases, and the edge proportion as well as the number of samples $T$ is held fixed. Remarkably, the degradation as $N$ increases is limited, it is primarily the graph topology (SCG or non-SCG) as well as the level of sparsity (measured by $q$) which are the determining factors for support recovery performance.

Figure 8

Figure 7. Provides a support recovery comparison for very small values of, $T$ typical for many applications. The system has a fixed number $N=50$ nodes.

Figure 9

Figure 8. Is a low-dimensional embedding representing the classification of alcoholic and non-alcoholic subjects based entirely on the Granger causality graphs constructed from EEG signals. Figure 8b further illustrates a classification task using EEG causality graphs. In this case, one particular subject can be almost perfectly separated from another particular subject with a binary classifier. These results indicate the causality graph carries consistent and identifiable cross-subject patterns.

Supplementary material: File

Kinnear and Mazumdar supplementary material

Kinnear and Mazumdar supplementary material 1

Download Kinnear and Mazumdar supplementary material(File)
File 18 KB
Supplementary material: PDF

Kinnear and Mazumdar supplementary material

Kinnear and Mazumdar supplementary material 2

Download Kinnear and Mazumdar supplementary material(PDF)
PDF 437.1 KB