Skip to main content
Top

2021 | OriginalPaper | Chapter

Classification Bandits: Classification Using Expected Rewards as Imperfect Discriminators

Authors : Koji Tabata, Atsuyoshi Nakumura, Tamiki Komatsuzaki

Published in: Trends and Applications in Knowledge Discovery and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A classification bandits problem is a new class of multi-armed bandits problems in which an agent must classify a given set of arms into positive or negative depending on whether the number of bad arms are at least \(N_2\) or at most \(N_1(<N_2)\) by drawing as fewer arms as possible. In our problem setting, bad arms are imperfectly characterized as the arms with above-threshold expected rewards (losses). We develop a method of reducing classification bandits to simpler one threshold classification bandits and propose an algorithm for the problem that classifies a given set of arms correctly with a specified confidence. Our numerical experiments demonstrate effectiveness of our proposed method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Histopathologists usually diagnose whether cells are of cancer or not by inspecting their morphological characteristics with a human bias, but Raman measurements are considered to enable more reliably to judge the cell states.
 
Literature
1.
go back to reference Audibert, J.Y., Bubeck, S.: Best arm identification in multi-armed bandits (2010) Audibert, J.Y., Bubeck, S.: Best arm identification in multi-armed bandits (2010)
2.
go back to reference Gabillon, V., Ghavamzadeh, M., Lazaric, A.: Best arm identification: a unified approach to fixed budget and fixed confidence. In: Advances in Neural Information Processing Systems, pp. 3212–3220 (2012) Gabillon, V., Ghavamzadeh, M., Lazaric, A.: Best arm identification: a unified approach to fixed budget and fixed confidence. In: Advances in Neural Information Processing Systems, pp. 3212–3220 (2012)
3.
go back to reference Kalyanakrishnan, S., Tewari, A., Auer, P., Stone, P.: Pac subset selection in stochastic multi-armed bandits. In: ICML, vol. 12, pp. 655–662 (2012) Kalyanakrishnan, S., Tewari, A., Auer, P., Stone, P.: Pac subset selection in stochastic multi-armed bandits. In: ICML, vol. 12, pp. 655–662 (2012)
5.
go back to reference Kaufmann, E., Koolen, W.M., Garivier, A.: Sequential test for the lowest mean: from Thompson to murphy sampling. In: Advances in Neural Information Processing Systems, pp. 6332–6342 (2018) Kaufmann, E., Koolen, W.M., Garivier, A.: Sequential test for the lowest mean: from Thompson to murphy sampling. In: Advances in Neural Information Processing Systems, pp. 6332–6342 (2018)
6.
go back to reference Locatelli, A., Gutzeit, M., Carpentier, A.: An optimal algorithm for the thresholding bandit problem. In: Proceedings of the 33rd International Conference on International Conference on Machine Learning, vol. 48, pp. 1690–1698 (2016) Locatelli, A., Gutzeit, M., Carpentier, A.: An optimal algorithm for the thresholding bandit problem. In: Proceedings of the 33rd International Conference on International Conference on Machine Learning, vol. 48, pp. 1690–1698 (2016)
7.
go back to reference Nachlas, J.A., Loney, S.R., Binney, B.A.: Diagnostic-strategy selection for series systems. IEEE Trans. Reliab. 39(3), 273–280 (1990)CrossRef Nachlas, J.A., Loney, S.R., Binney, B.A.: Diagnostic-strategy selection for series systems. IEEE Trans. Reliab. 39(3), 273–280 (1990)CrossRef
8.
9.
go back to reference Raghavan, V., Shakeri, M., Pattipati, K.: Test sequencing algorithms with unreliable tests. IEEE Trans. Syst. Man Cybern.-Part A: Syst. Humans 29(4), 347–357 (1999)CrossRef Raghavan, V., Shakeri, M., Pattipati, K.: Test sequencing algorithms with unreliable tests. IEEE Trans. Syst. Man Cybern.-Part A: Syst. Humans 29(4), 347–357 (1999)CrossRef
Metadata
Title
Classification Bandits: Classification Using Expected Rewards as Imperfect Discriminators
Authors
Koji Tabata
Atsuyoshi Nakumura
Tamiki Komatsuzaki
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-75015-2_6

Premium Partner