Hostname: page-component-77c78cf97d-9dm9z Total loading time: 0 Render date: 2026-05-04T23:11:04.992Z Has data issue: false hasContentIssue false

Accelerating parallel tempering: Quantile tempering algorithm (QuanTA)

Published online by Cambridge University Press:  03 September 2019

Nicholas G. Tawn*
Affiliation:
University of Warwick
Gareth O. Roberts*
Affiliation:
University of Warwick
*
*Postal address: Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK.
*Postal address: Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK.
Rights & Permissions [Opens in a new window]

Abstract

It is well known that traditional Markov chain Monte Carlo (MCMC) methods can fail to effectively explore the state space for multimodal problems. Parallel tempering is a well-established population approach for such target distributions involving a collection of particles indexed by temperature. However, this method can suffer dramatically from the curse of dimensionality. In this paper we introduce an improvement on parallel tempering called QuanTA. A comprehensive theoretical analysis quantifying the improved efficiency and scalability of the approach is given. Under weak regularity conditions, QuanTA gives accelerated mixing through the temperature space. Empirical evidence of the effectiveness of this new algorithm is illustrated on canonical examples.

Information

Type
Original Article
Copyright
© Applied Probability Trust 2019 
Figure 0

Figure 1: Trace plots of the target state chains for representative runs of the parallel tempering (top) and QuanTA schemes (bottom).

Figure 1

Figure 2: Plots of a simple bimodal Gaussian mixture distribution at two temperature levels $\beta_i$ and $\beta_j$. The metric $m(\cdot,\cdot)$ that allocates locations to mode points is chosen to be the Euclidean metric on $\mathbb{R}$. The shaded regions represent $A_{ij}$ and $A_{ji}$ in the top and bottom plots, respectively.

Figure 2

Figure 3: The (non-normalised) tempered target distributions for (21) for inverse temperatures $\Delta=\{1,0.0002,0.0002^2 \},$ respectively.

Figure 3

Table 1: Comparison of the acceptance rates of swap moves for the PT algorithm and QuanTA targeting the one-dimensional distribution given in (21) and setup with the ambitious inverse temperature schedule given by $\Delta=\{ 1,0.0002,0.0002^2\}$.

Figure 4

Figure 4: For the target given in (21), the running weight approximations for the mode centred on 200 with target weight $w_5=0.2$ for 10 separate runs of the PT and QuanTA schemes. Left: the PT runs showing slow and variable estimates for $w_5$. Right: the new QuanTA scheme showing fast, unbiased convergence to the true value for $w_5$.

Figure 5

Figure 5: Trace plots of the first component of the twenty dimensional cold state chains for representative runs of the PT (top) and new QuanTA (bottom) schemes. Note the fast inter-modal mixing of the new QuanTA scheme, allowing rapid exploration of the target distribution. In contrast the PT scheme never escapes the initialising mode.

Figure 6

Table 2: Comparison of the acceptance rates of swap moves for the PT algorithm and QuanTA targeting the twenty-dimensional distribution given in (22) and setup with the ambitious inverse temperature schedule given by $\{ 1,0.002,0.002^2 ,0.002^3 \}$.

Figure 7

Figure 6: Trace plots of the first component of the five dimensional cold state chains for representative runs of the PT and QuanTA schemes respectively. Note the difference in inter-modal mixing between the QuanTA scheme and the PT scheme which struggles to escape the initialisation mode.

Figure 8

Table 3: Comparison of the acceptance rates of swap moves for the PT and new QuanTA algorithms targeting the five-dimensional distribution given in (23) and setup with the ambitious inverse temperature schedule given in (24). Note that, for QuanTA, the reparametrised swap move was only used for swaps in the coldest seven levels.

Figure 9

Table 4: Complexity comparisons between QuanTA and PT for the one-dimensional example.

Figure 10

Table 5: Complexity comparisons between QuanTA and PT for the twenty-dimensional example.