Hostname: page-component-5db58dd55d-4jdj6 Total loading time: 0 Render date: 2026-06-03T00:18:18.880Z Has data issue: false hasContentIssue false

Change-Point Detection and Regularization in Time Series Cross-Sectional Data Analysis

Published online by Cambridge University Press:  07 October 2022

Jong Hee Park*
Affiliation:
Department of Political Science and International Relations, IR Data Center, Seoul National University, Seoul, South Korea. E-mail: jongheepark@snu.ac.kr
Soichiro Yamauchi
Affiliation:
Department of Government, Harvard University, Cambridge, MA, USA. E-mail: syamauchi@g.harvard.edu
*
Corresponding author Jong Hee Park
Rights & Permissions [Opens in a new window]

Abstract

Researchers of time series cross-sectional data regularly face the change-point problem, which requires them to discern between significant parametric shifts that can be deemed structural changes and minor parametric shifts that must be considered noise. In this paper, we develop a general Bayesian method for change-point detection in high-dimensional data and present its application in the context of the fixed-effect model. Our proposed method, hidden Markov Bayesian bridge model, jointly estimates high-dimensional regime-specific parameters and hidden regime transitions in a unified way. We apply our method to Alvarez, Garrett, and Lange’s (1991, American Political Science Review 85, 539–556) study of the relationship between government partisanship and economic growth and Allee and Scalera’s (2012, International Organization 66, 243–276) study of membership effects in international organizations. In both applications, we found that the proposed method successfully identify substantively meaningful temporal heterogeneity in parameters of regression models.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s) 2022. Published by Cambridge University Press on behalf of the Society for Political Methodology
Figure 0

Figure 1 Illustration of the change-point problem in regression models with a large number of predictors.

Figure 1

Figure 2 Simulation outcomes from 24 sets of TSCS data. The brown circles indicate the true break numbers. Panel (a) indicates the root-mean-square error of time-varying parameters. Panel (b) is WAIC. A lower WAIC score indicates a good predictive accuracy.

Figure 2

Figure 3 Simulation outcomes from 24 sets of TSCS data. Panel (a) shows recovered hidden states (gray) over true states (black). We jittered hidden state estimates for easy comparison. Panel (b) is a stabilized Gelman–Rubin statistics (Vats and Knudson 2021). The values close to 1 indicate good convergence.

Figure 3

Figure 4 Over-detection (a) and under-detection (b) of hidden states: The top plot in each panel shows the posterior estimates of time-varying parameters, which is computed by $p(\beta_{k,t}|\mathbf{y}) = \sum_{m=1}^{M} p(\beta_{k}, s_t = m |\mathbf{y})$. The bottom plot shows hidden state probabilities ($p(s_{t}|\mathbf{y})$). The data are simulated from $n = 20, t= 60$, and $k=30$.

Figure 4

Table 1 WAIC scores of HMBBs on Alvarez et al. (1991): The estimation is based on 10,000 MCMC runs after discarding the first 10,000 MCMC runs.

Figure 5

Figure 5 Hidden state transitions and time-varying movements of parameters in Alvarez et al.’s (1991) partial interaction model.

Figure 6

Figure 6 Comparison of parameter estimates: Fixed-effects estimates are obtained by the least squares method and the panel robust standard error of MacKinnon and White (1985). HMBB estimates are obtained from a single-break model. The detected break point is between 1978 and 1979.

Figure 7

Table 2 WAIC scores of HMBBs on Allee and Scalera’s (2012) Column (5) model and Column (6) model: The estimation is based on 10,000 MCMC runs after discarding the first 10,000 MCMC runs.

Figure 8

Figure 7 Hidden state transitions and time-varying movements of parameters in Allee and Scalera (2012)

Figure 9

Figure 8 Comparison of parameter estimates based on Allee and Scalera (2012): Fixed-effects estimates are obtained by the least-squares method. HMBB estimates are obtained from a two-break model. The detected break points are 1964 and 2007.

Figure 10

Table 3 Variable selection of HMBB estimates using the DSS method: DSS indicates the sparsified estimates of HMBB outputs. The employed model is a single-break HMBB fitted on the full interaction model of Alvarez et al.1991.

Supplementary material: Link

Park and Yamauchi Dataset

Link
Supplementary material: PDF

Park and Yamauchi supplementary material

Park and Yamauchi supplementary material

Download Park and Yamauchi supplementary material(PDF)
PDF 978.1 KB