Learning to Play Stackelberg Security Games

Avrim Blum; Nika Haghtalab; Ariel D. Procaccia

doi:10.1017/9781316676714.025

25 - Learning to Play Stackelberg Security Games

Published online by Cambridge University Press: 13 December 2017

Avrim Blum ,

Nika Haghtalab and

Ariel D. Procaccia

Edited by

Ali E. Abbas ,

Milind Tambe and

Detlof von Winterfeldt

Show author details

Ali E. Abbas: Affiliation:
University of Southern California
Milind Tambe: Affiliation:
University of Southern California
Detlof von Winterfeldt: Affiliation:
University of Southern California

Book contents

Get access

Summary

A summary is not available for this content so a preview has been provided. Please use the Get access link above for information on how to access this content.

Image of the first page of this content. For PDF version, please use the ‘Save PDF’ preceeding this image.'

Type: Chapter
Information: Improving Homeland Security Decisions , pp. 604 - 626

DOI: https://doi.org/10.1017/9781316676714.025 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2017

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2), 235–256.Google Scholar

Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (2002). The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1), 48–77.CrossRef Google Scholar

Awerbuch, B., & Kleinberg, R. (2008). Online linear optimization and adaptive routing. Journal of Computer and System Sciences, 74(1), 97–114.Google Scholar

Awerbuch, B., & Mansour, Y. (2003). Adapting to a reliable network path. In Proceedings of the 22nd ACM symposium on principles of distributed computing (PODC) (pp. 360–367).Google Scholar

Balcan, M.-F., Blum, A., Haghtalab, N., & Procaccia, A. D. (2015). Commitment without regrets: Online learning in Stackelberg security games. In Proceedings of the 16th ACM Conference on Economics and Computation (EC) (pp. 61–78).Google Scholar

Bertsimas, D., & Vempala, S. (2004). Solving convex programs by random walks. Journal of the ACM, 51(4), 540–556.Google Scholar

Blum, A., Haghtalab, N., & Procaccia, A. D. (2014). Learning optimal commitment to overcome insecurity. In Proceedings of the 28th Annual Conference on Neural Information Processing Systems (NIPS) (pp. 1826–1834).Google Scholar

Blum, A., & Mansour, Y. (2007). Learning, regret minimization, and equilibria. In Nisan, N., Roughgarden, T., Tardos, É., & Vazirani, V. (Eds.), Algorithmic game theory (chapter 4). Cambridge University Press.Google Scholar

Browne, C., Powley, E. J., Whitehouse, D., Lucas, S. M., Cowling, P. I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., & Colton, S. (2012). A survey of Monte Carlo tree search methods. IEEE Transactions on Computational Intelligence and AI in Games, 4(1), 1–43, 2012.CrossRef Google Scholar

Cesa-Bianchi, N., Mansour, Y., & Stoltz, G. (2005). Improved second-order bounds for prediction with expert advice. In Proceedings of the 18th Conference on Computational Learning Theory (COLT) (pp. 217–232).Google Scholar

Chakraborty, D., Agmon, N., & Stone, P. (2013). Targeted opponent modeling of memory-bounded agents. In Proceedings of the Adaptive Learning Agents Workshop (ALA) (pp. 211–226).Google Scholar

Conitzer, V., & Sandholm, T. (2006). Computing the optimal strategy to commit to. In Proceedings of the 7th ACM Conference on Economics and Computation (EC) (pp. 82–90).Google Scholar

Grötschel, M., Lovász, L., & Schrijver, A. (1988). Geometric algorithms and combinatorial optimization. Springer.CrossRef Google Scholar

Hannan, J. (1957). Approximation to Bayes risk in repeated plays. In Dresher, M., Tucker, A., & Wolfe, P. (Eds.), Contributions to the theory of games (vol. 3, pp. 97–139). Princeton University Press.Google Scholar

Khachiyan, L. G. (1979). A polynomial algorithm in linear programming. Soviet Mathematics Doklady, 20, 191–194.Google Scholar

Kiekintveld, C., Islam, T., & Kreinovich, V. (2013). Security games with interval uncertainty. In Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (pp. 231–238).Google Scholar

Kiekintveld, C., & Jain, M. (2015). Basic solution concepts and algorithms for Stackelberg security games. In Abbas, A., Tambe, M., & von Winterfeldt, D. (Eds.), Improving homeland security decisions (chapter 17).Google Scholar

Kocsis, L., & Szepesvári, C. (2006). Bandit based Monte-Carlo planning. In Proceedings of the 17th European Conference on Machine Learning (ECML) (pp. 282–293).Google Scholar

Letchford, J., Conitzer, V., & Munagala, K. (2009). Learning and approximating the optimal strategy to commit to. In Proceedings of the 2nd International Symposium on Algorithmic Game Theory (SAGT) (pp. 250–262).Google Scholar

Littlestone, N., & Warmuth, M. K. (1994). The weighted majority algorithm. Information and Computation, pages 212–261.CrossRef Google Scholar

Marecki, J., Tesauro, G., & Segal, R. (2012). Playing repeated Stackelberg games with unknown opponents. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (pp. 821–828).Google Scholar

Nguyen, T. H., Yadav, A., An, B., Tambe, M., & Boutilier, C. (2014). Regret-based optimization and preference elicitation for Stackelberg security games with uncertainty. In Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI) (pp. 756–762).Google Scholar

Nguyen, T. H. Yang, R. Azaria, A. Kraus, S., & Tambe, M. (2013). Analyzing the effectiveness of adversary modeling in security games. In Proceedings of the 27th AAAI Conference on Artificial Intelligence (AAAI) (pp. 718–724).Google Scholar

Tauman Kalai, A., & Vempala, S. (2006). Simulated annealing for convex optimization. Mathematics of Operations Research, 31(2), 253–266.CrossRef Google Scholar

Yang, R., Ford, B. J., Tambe, M., & Lemieux, A. (2014). Adaptive resource allocation for wildlife protection against illegal poachers. In Proceedings of the 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (pp. 453–460).Google Scholar

Book contents

25 - Learning to Play Stackelberg Security Games

Summary

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive