Hostname: page-component-8448b6f56d-cfpbc Total loading time: 0 Render date: 2024-04-18T21:35:23.763Z Has data issue: false hasContentIssue false

Complete expected improvement converges to an optimal budget allocation

Published online by Cambridge University Press:  22 July 2019

Ye Chen*
Affiliation:
Virginia Common wealth University
Ilya O. Ryzhov*
Affiliation:
University of Maryland
*
*Postal address: Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, VA 23284, USA.
**Postal address: Robert H. Smith School of Business, University of Maryland, College Park, MD 20742, USA. Email address: iryzhov@rhsmith.umd.edu

Abstract

The ranking and selection problem is a well-known mathematical framework for the formal study of optimal information collection. Expected improvement (EI) is a leading algorithmic approach to this problem; the practical benefits of EI have repeatedly been demonstrated in the literature, especially in the widely studied setting of Gaussian sampling distributions. However, it was recently proved that some of the most well-known EI-type methods achieve suboptimal convergence rates. We investigate a recently proposed variant of EI (known as ‘complete EI’) and prove that, with some minor modifications, it can be made to converge to the rate-optimal static budget allocation without requiring any tuning.

Type
Original Article
Copyright
© Applied Probability Trust 2019 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Bechhofer, R. E. (1954). A single-sample multiple decision procedure for ranking means of normal populations with known variances. Ann. Math. Statist. 25, 1639.Google Scholar
Branke, J., Chick, S.E. and Schmidt, C. (2007). Selecting a selection procedure. Manag. Sci. 53, 19161932.CrossRefGoogle Scholar
Chau, M., FU, M.C., QU, H. and Ryzhov, I.O. (2014). Simulation optimization: A tutorial overview and recent developments in gradient-based methods. In Proc. 2014 Winter Simulation Conference, eds A. Tolk et al., IEEE, Piscataway, NJ, pp. 2135.CrossRefGoogle Scholar
Chen, C.-H., LIN, J., YüCesan, E. and Chick, S. E. (2000). Simulation budget allocation for further enhancing the efficiency of ordinal optimization. Discrete Event Dynamic Systems 10, 251270.CrossRefGoogle Scholar
Chen, Y. and Ryzhov, I. O. (2017). Rate-optimality of the complete expected improvement criterion. In Proc. 2017 Winter Simulation Conference, eds Chan, W.K.V. et al., IEEE, Piscataway, NJ, pp. 21732182.CrossRefGoogle Scholar
Chick, S.E., Branke, J. and Schmidt, C. (2010). Sequential sampling to myopically maximize the expected value of information. INFORMS J. Computing 22, 7180.Google Scholar
Degroot, M.H. (1970). Optimal Statistical Decisions . John Wiley, Hoboken.Google Scholar
Gittins, J., Glazebrook, K. and Weber, R. (2011). Multi-Armed Bandit Allocation Indices, 2nd edn. John Wiley, Chichester.Google Scholar
Glynn, P.W. and Juneja, S. (2004). A large deviations perspective on ordinal optimization. In Proc. 2004 Winter Simulation Conference, eds R. Ingalls et al., IEEE, pp. 577585.Google Scholar
HAN, B., Ryzhov, I.O. and Defourny, B. (2016). Optimal learning in linear regression with combinatorial feature selection. INFORMS J. Computing 28, 721735.CrossRefGoogle Scholar
Hong, L.J. and Nelson, B. L. (2009). A brief introduction to optimization via simulation. In Proc. 2009 Winter Simulation Conference, eds Rosetti, M. et al., IEEE, pp. 7585.CrossRefGoogle Scholar
Hunter, S.R. and Mc Closky, B. (2016). Maximizing quantitative traits in the mating design problem via simulation-based Pareto estimation. IIE Trans. 48, 565578.CrossRefGoogle Scholar
Jones, D.R., Schonlau, M. and Welch, W. J. (1998). Efficient global optimization of expensive black-box functions. J. Global Optimization 13, 455492.CrossRefGoogle Scholar
Kim, S.-H. and Nelson, B. L. (2001). A fully sequential procedure for indifference-zone selection in simulation. ACM Trans. Model. Comput. Simul. 11, 251273.CrossRefGoogle Scholar
Pasupathy, R. et al. (2014). Stochastically constrained ranking and selection via SCORE. ACM Trans. Model. Comput. Simul. 25, 26p.CrossRefGoogle Scholar
Peng, Y. and FU, M. C. (2017). Myopic allocation policy with asymptotically optimal sampling rate. IEEE Trans. Automatic Control 62, 20412047.CrossRefGoogle Scholar
Powell, W.B. and Ryzhov, I. O. (2012). Optimal Learning . John Wiley, Hoboken.CrossRefGoogle Scholar
Qin, C., Klabjan, D. and Russo, D. (2017). Improving the expected improvement algorithm. In Advances in Neural Information Processing Systems, Vol. 30, eds I. Guyon et al., Neural Information Processing Systems, pp. 53815391.Google Scholar
Ruben, H. (1962). A new asymptotic expansion for the normal probability integral and Mill’s ratio. J. R. Statist. Soc. B 24, 177179.Google Scholar
Russo, D. (2017). Simple Bayesian algorithms for best arm identification. Preprint. Available at https://arxiv.org/abs/1602.08448.Google Scholar
Russo, D. and Van Roy, B. (2014). Learning to optimize via posterior sampling. Math. Operat. Res. 39, 12211243.CrossRefGoogle Scholar
Ryzhov, I. O. (2016). On the convergence rates of expected improvement methods. Operat. Res. 64, 15151528.Google Scholar
Salemi, P., Nelson, B.L. and Staum, J. (2014). Discrete optimization via simulation using Gaussian Markov random fields. In Proc. 2014 Winter Simulation Conference, eds Tolk, A. et al., IEEE, Piscataway, NJ, pp. 38093820.CrossRefGoogle Scholar
Scott, W.R., Powell, W.B. And Simão, H. P. (2010). Calibrating simulation models using the knowledge gradient with continuous parameters. In Proc. 2010 Winter Simulation Conference, eds Johansson, B. et al., IEEE, Piscataway, NJ, pp. 10991109.CrossRefGoogle Scholar