Hostname: page-component-77f85d65b8-8wtlm Total loading time: 0 Render date: 2026-03-30T01:43:56.147Z Has data issue: false hasContentIssue false

Bayesian optimization with informative parametric models via sequential Monte Carlo

Published online by Cambridge University Press:  08 March 2022

Rafael Oliveira*
Affiliation:
Brain and Mind Centre, The University of Sydney, Sydney, New South Wales, Australia Data Analytics for Resources and Environments, Australian Research Council, Sydney, New South Wales, Australia
Richard Scalzo
Affiliation:
Data Analytics for Resources and Environments, Australian Research Council, Sydney, New South Wales, Australia School of Mathematics and Statistics, The University of Sydney, Sydney, New South Wales, Australia
Robert Kohn
Affiliation:
Data Analytics for Resources and Environments, Australian Research Council, Sydney, New South Wales, Australia School of Economics, University of New South Wales, Sydney, New South Wales, Australia
Sally Cripps
Affiliation:
Data Analytics for Resources and Environments, Australian Research Council, Sydney, New South Wales, Australia Data61, Commonwealth Scientific and Industrial Research Organisation, Sydney, New South Wales, Australia
Kyle Hardman
Affiliation:
Nomad Atomics, Canberra, Australian Capital Territory, Australia Department of Quantum Science & Technology, Australian National University, Canberra, Australian Capital Territory, Australia
John Close
Affiliation:
Department of Quantum Science & Technology, Australian National University, Canberra, Australian Capital Territory, Australia
Nasrin Taghavi
Affiliation:
School of Engineering and Information Technology, University of New South Wales, Canberra, Australian Capital Territory, Australia
Charles Lemckert
Affiliation:
School of Design and the Built Environment, University of Canberra, Canberra, Australian Capital Territory, Australia
*
*Corresponding author. E-mail: rafael.oliveira@sydney.edu.au

Abstract

Bayesian optimization (BO) has been a successful approach to optimize expensive functions whose prior knowledge can be specified by means of a probabilistic model. Due to their expressiveness and tractable closed-form predictive distributions, Gaussian process (GP) surrogate models have been the default go-to choice when deriving BO frameworks. However, as nonparametric models, GPs offer very little in terms of interpretability and informative power when applied to model complex physical phenomena in scientific applications. In addition, the Gaussian assumption also limits the applicability of GPs to problems where the variables of interest may highly deviate from Gaussianity. In this article, we investigate an alternative modeling framework for BO which makes use of sequential Monte Carlo (SMC) to perform Bayesian inference with parametric models. We propose a BO algorithm to take advantage of SMC’s flexible posterior representations and provide methods to compensate for bias in the approximations and reduce particle degeneracy. Experimental results on simulated engineering applications in detecting water leaks and contaminant source localization are presented showing performance improvements over GP-based BO approaches.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Posterior CDF approximation errors for the exponential-gamma model using $ T=2 $ observations. For each sample size, which corresponds to the number of SMC particles, SMC runs were repeated 400 times for each method, except for the jackknife, which was rerun 40 times due to a longer run time.The theoretical upper confidence bound on the CDF approximation error cn(δ) (Theorem 1) is shown as the plotted blue line. The frequency of violation of the theoretical bounds for i.i.d. empirical CDF errors is also presented on the top of each plot, alongside the target ($ \delta =0.1 $).

Figure 1

Figure 2. Posterior CDF approximation errors for the exponential-gamma model using $ T=5 $ observations. For each sample size, which corresponds to the number of SMC particles, SMC runs were repeated 400 times for each method, except for the jackknife, which was rerun 40 times due to a longer run time.The theoretical upper confidence bound on the CDF approximation error cn(δ) (Theorem 1) is shown as the plotted blue line. The frequency of violation of the theoretical bounds for i.i.d. empirical CDF errors is also presented on the top of each plot, alongside the target ($ \delta =0.1 $).

Figure 2

Figure 3. Linear Gaussian case: (a) mean regret of SMC-UCB for different $ n $ compared to the GP-UCB baseline with parameter dimension $ m:= 10 $; (b) approximation error between the SMC quantile $ {\hat{q}}_t\left({\mathbf{x}}_t,{\delta}_t\right) $ and the true $ {q}_t\left({\mathbf{x}}_t,{\delta}_t\right) $ at SMC-UCB’s selected query point $ {\mathbf{x}}_t $ for different $ n $ settings (absent values correspond to cases where $ {\hat{q}}_t\left(\mathbf{x},{\delta}_t\right)=\infty $); (c) comparison with the non-UCB, GP-based expected improvement algorithm; and (d) effect of parameter dimension $ m $ on optimization performance when compared to the median performance of the GP optimization baselines. All results were averaged over 10 runs. The shaded areas correspond to $ \pm 1 $ standard deviation.

Figure 3

Figure 4. Pipe simulation diagram: A water pipe of 2 in diameter is buried 3 underground in a large block of soil ($ 100\times 100\times 50\;\mathrm{m} $).

Figure 4

Figure 5. Performance results for water leak detection experiment: (a) The gravity objective function generated by CFD simulations and the mean regret curves for each algorithm. The shaded areas in the plot correspond $ \pm 1 $ standard deviation results from results which were averaged over 10 trials. (b) The gravity estimates according to the final SMC and GP posteriors after 100 iterations. (c) SMC estimates for the parameters concerning the location of the leak. The upper plot in (c) is colored according to an estimate for the mass of leaked water.

Figure 5

Figure 6. Contaminant source localization problem: (a) The optimization performance of each algorithm in terms of regret. Results were averaged over 10 runs. (b) A final SMC estimate for the source location, while the true location is marked as a red star.

Submit a response

Comments

No Comments have been published for this article.