Crossref Citations
                  
                    
                    
                      
                        This article has been cited by the following publications. This list is generated based on data provided by 
    Crossref.
                     
                   
                  
                        
                          
                                
                                
                                    
                                    Andradóttir, Sigrún
                                  1996.
                                  A Global Search Method for Discrete Stochastic Optimization.
                                  
                                  
                                  SIAM Journal on Optimization, 
                                  Vol. 6, 
                                  Issue. 2, 
                                
                                    p. 
                                    513.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Auer, P.
                                  2000.
                                  Using upper confidence bounds for online learning.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    270.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Cesa-Bianchi, Nicolò
                                  2002.
                                  MULTIARMED BANDITS IN THE WORST CASE.
                                  
                                  
                                  IFAC Proceedings Volumes, 
                                  Vol. 35, 
                                  Issue. 1, 
                                
                                    p. 
                                    91.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Auer, Peter
                                    
                                    Cesa-Bianchi, Nicolò
                                     and 
                                    Fischer, Paul
                                  2002.
                                  Finite-time Analysis of the Multiarmed Bandit Problem.
                                  
                                  
                                  Machine Learning, 
                                  Vol. 47, 
                                  Issue. 2-3, 
                                
                                    p. 
                                    235.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Chang, Hyeong Soo
                                    
                                    Fu, Michael C.
                                    
                                    Hu, Jiaqiao
                                     and 
                                    Marcus, Steven I.
                                  2005.
                                  An Adaptive Sampling Algorithm for Solving Markov Decision Processes.
                                  
                                  
                                  Operations Research, 
                                  Vol. 53, 
                                  Issue. 1, 
                                
                                    p. 
                                    126.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Andradóttir, Sigrún
                                  2006.
                                  Simulation optimization with countably infinite feasible regions.
                                  
                                  
                                  ACM Transactions on Modeling and Computer Simulation, 
                                  Vol. 16, 
                                  Issue. 4, 
                                
                                    p. 
                                    357.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Pandey, Sandeep
                                    
                                    Chakrabarti, Deepayan
                                     and 
                                    Agarwal, Deepak
                                  2007.
                                  Multi-armed bandit problems with dependent arms.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    721.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Audibert, Jean-Yves
                                    
                                    Munos, Rémi
                                     and 
                                    Szepesvári, Csaba
                                  2007.
                                  Algorithmic Learning Theory.
                                  
                                  
                                  
                                  Vol. 4754, 
                                  Issue. , 
                                
                                    p. 
                                    150.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Alaya-Feki, Afef Ben Hadj
                                    
                                    Moulines, Eric
                                     and 
                                    LeCornec, Alain
                                  2008.
                                  Dynamic spectrum access with non-stationary Multi-Armed Bandit.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    416.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Mersereau, A.J.
                                    
                                    Rusmevichientong, P.
                                     and 
                                    Tsitsiklis, J.N.
                                  2009.
                                  A Structured Multiarmed Bandit Problem and the Greedy Policy.
                                  
                                  
                                  IEEE Transactions on Automatic Control, 
                                  Vol. 54, 
                                  Issue. 12, 
                                
                                    p. 
                                    2787.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Jouini, Wassim
                                    
                                    Ernst, Damien
                                    
                                    Moy, Christophe
                                     and 
                                    Palicot, Jacques
                                  2009.
                                  Multi-armed bandit based policies for cognitive radio's decision making issues.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    1.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Audibert, Jean-Yves
                                    
                                    Munos, Rémi
                                     and 
                                    Szepesvári, Csaba
                                  2009.
                                  Exploration–exploitation tradeoff using variance estimates in multi-armed bandits.
                                  
                                  
                                  Theoretical Computer Science, 
                                  Vol. 410, 
                                  Issue. 19, 
                                
                                    p. 
                                    1876.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Auer, Peter
                                     and 
                                    Ortner, Ronald
                                  2010.
                                  UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem.
                                  
                                  
                                  Periodica Mathematica Hungarica, 
                                  Vol. 61, 
                                  Issue. 1-2, 
                                
                                    p. 
                                    55.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Scott, Steven L.
                                  2010.
                                  A modern Bayesian look at the multi‐armed bandit.
                                  
                                  
                                  Applied Stochastic Models in Business and Industry, 
                                  Vol. 26, 
                                  Issue. 6, 
                                
                                    p. 
                                    639.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Liu, Keqin
                                     and 
                                    Zhao, Qing
                                  2010.
                                  Decentralized multi-armed bandit with multiple distributed players.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    1.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Anandkumar, Animashree
                                    
                                    Michael, Nithin
                                     and 
                                    Tang, Ao
                                  2010.
                                  Opportunistic Spectrum Access with Multiple Users: Learning under Competition.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    1.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Li, Lihong
                                    
                                    Chu, Wei
                                    
                                    Langford, John
                                     and 
                                    Schapire, Robert E.
                                  2010.
                                  A contextual-bandit approach to personalized news article recommendation.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    661.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Gai, Yi
                                    
                                    Krishnamachari, Bhaskar
                                     and 
                                    Jain, Rahul
                                  2010.
                                  Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation.
                                  
                                  
                                  
                                  
                                  
                                
                                    p. 
                                    1.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Rusmevichientong, Paat
                                     and 
                                    Tsitsiklis, John N.
                                  2010.
                                  Linearly Parameterized Bandits.
                                  
                                  
                                  Mathematics of Operations Research, 
                                  Vol. 35, 
                                  Issue. 2, 
                                
                                    p. 
                                    395.
                                
                                
                        
                        
                        
                        
      
                          
                                
                                
                                    
                                    Liu, Keqin
                                     and 
                                    Zhao, Qing
                                  2010.
                                  Distributed Learning in Multi-Armed Bandit With Multiple Players.
                                  
                                  
                                  IEEE Transactions on Signal Processing, 
                                  Vol. 58, 
                                  Issue. 11, 
                                
                                    p. 
                                    5667.