On the complexity of All ε-Best Arms Identification

Aymen Al Marjani; Tomas Kocak; Aurélien Garivier

Pré-Publication, Document De Travail Année : 2022

On the complexity of All ε-Best Arms Identification

(1) , (2) , (1)

1
2

Aymen Al Marjani

Fonction : Auteur
PersonId : 1118574

Unité de Mathématiques Pures et Appliquées

Tomas Kocak

Fonction : Auteur
PersonId : 1029739

University of Potsdam = Universität Potsdam

Aurélien Garivier

Fonction : Auteur
PersonId : 4986
IdHAL : aurelien-garivier
ORCID : 0000-0002-4906-9573
IdRef : 111902495

Unité de Mathématiques Pures et Appliquées

Résumé

We consider the problem introduced by [MJTN20] of identifying all the ε-optimal arms in a finite stochastic multi-armed bandit with Gaussian rewards. In the fixed confidence setting, we give a lower bound on the number of samples required by any algorithm that returns the set of ε-good arms with a failure probability less than some risk level δ. This bound writes as T * ε (µ) log(1/δ), where T * ε (µ) is a characteristic time that depends on the vector of mean rewards µ and the accuracy parameter ε. We also provide an efficient numerical method to solve the convex max-min program that defines the characteristic time. Our method is based on a complete characterization of the alternative bandit instances that the optimal sampling strategy needs to rule out, thus making our bound tighter than the one provided by [MJTN20]. Using this method, we propose a Track-and-Stop algorithm that identifies the set of ε-good arms w.h.p and enjoys asymptotic optimality (when δ goes to zero) in terms of the expected sample complexity. Finally, using numerical simulations, we demonstrate our algorithm's advantage over state-of-the-art methods, even for moderate values of the risk parameter.

Domaines

Mathématiques [math]

Fichier principal

All_epsilon_arxiv.pdf (602.87 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Aymen Al Marjani : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03570280

Soumis le : dimanche 13 février 2022-11:59:16

Dernière modification le : lundi 11 mars 2024-14:28:41

Archivage à long terme le : samedi 14 mai 2022-18:11:26

Dates et versions

hal-03570280 , version 1 (13-02-2022)

hal-03570280 , version 2 (20-06-2022)

Identifiants

HAL Id : hal-03570280 , version 1

Citer

Aymen Al Marjani, Tomas Kocak, Aurélien Garivier. On the complexity of All ε-Best Arms Identification. 2022. ⟨hal-03570280v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

84 Consultations

102 Téléchargements

On the complexity of All ε-Best Arms Identification

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager