Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T02:51:15.791Z Has data issue: false hasContentIssue false

Improved Metropolis–Hastings algorithms via landscape modification with applications to simulated annealing and the Curie–Weiss model

Published online by Cambridge University Press:  30 August 2023

Michael C. H. Choi*
Affiliation:
National University of Singapore
*
*Postal address: Department of Statistics and Data Science, National University of Singapore, Singapore. Email address: mchchoi@nus.edu.sg
Rights & Permissions [Opens in a new window]

Abstract

In this paper, we propose new Metropolis–Hastings and simulated annealing algorithms on a finite state space via modifying the energy landscape. The core idea of landscape modification rests on introducing a parameter c, such that the landscape is modified once the algorithm is above this threshold parameter to encourage exploration, while the original landscape is utilized when the algorithm is below the threshold for exploitation purposes. We illustrate the power and benefits of landscape modification by investigating its effect on the classical Curie–Weiss model with Glauber dynamics and external magnetic field in the subcritical regime. This leads to a landscape-modified mean-field equation, and with appropriate choice of c the free energy landscape can be transformed from a double-well into a single-well landscape, while the location of the global minimum is preserved on the modified landscape. Consequently, running algorithms on the modified landscape can improve the convergence to the ground state in the Curie–Weiss model. In the setting of simulated annealing, we demonstrate that landscape modification can yield improved or even subexponential mean tunnelling time between global minima in the low-temperature regime by appropriate choice of c, and we give a convergence guarantee using an improved logarithmic cooling schedule with reduced critical height. We also discuss connections between landscape modification and other acceleration techniques, such as Catoni’s energy transformation algorithm, preconditioning, importance sampling, and quantum annealing. The technique developed in this paper is not limited to simulated annealing, but is broadly applicable to any difference-based discrete optimization algorithm by a change of landscape.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NC
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial licence (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of Applied Probability Trust
Figure 0

Figure 1. Landscape of $\frac{1}{\epsilon} \mathcal{H}$ and $\mathcal{H}^f_{\epsilon,c} + \frac{1}{\epsilon} \mathcal{H}_{\textrm{min}}$, where $\mathcal{H}(x) =\cos\! (2 x)+\frac{1}{2} \sin\! (x)+\frac{1}{3} \sin\! (10 x)$, $\epsilon \in \{0.25,0.5,0.75,1\}$, $c = -1.5$, and $f(z) = z$.

Figure 1

Figure 2. Plots of the free energy $g_{\epsilon}$ and the modified free energy $g^{f}_{\epsilon,c} + E_{min}$ with $h = -0.05$ and $f(z) = z$ at temperature $\epsilon = 1/1.5$, where $E_{min} = \min_{m \in [-1,1]} E(m)$. We shift the modified free energy by $E_{min}$ so that it is on the same scale as the original free energy $g_{\epsilon}$.

Figure 2

Figure 3. Plots of the mean-field equation (3.1) and the modified mean-field equation (3.4) with $h = -0.05$ and $f(z) = z$ at temperature $\epsilon = 1/1.5$.

Figure 3

Figure 4. Comparing the critical heights $H^0, H^f_{1,c}, c^*$ on a one-dimensional landscape generated by $\mathcal{H}$ with a local minimum at m and a global minimum, where we take $\epsilon = 1$, $f(z) = z$, and the proposal chain is of nearest-neighbour type, i.e. going left or right with probability $1/2$.

Figure 4

Table 1. Summary statistics of $\textrm{IP}$ on 1000 random TSP instances.

Figure 5

Figure 5. Histogram showing IP of ISA over SA on 1000 randomly generated TSP instances.

Figure 6

Figure 6. TSP objective value against iteration of ISA and SA.