Skip to main content

2015 | OriginalPaper | Buchkapitel

Calibrated k-labelsets for Ensemble Multi-label Classification

verfasst von : Ouadie Gharroudi, Haytham Elghazel, Alex Aussem

Erschienen in: Neural Information Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

RAndom k-labELsets (RAkEL) is an effective ensemble multi-label classification (MLC) model where each base-classifier is trained on a small random subset of k labels. However, the model construction does not fully benefit from the diversity of the ensemble and the label probability estimates obtained with RAkEL are usually badly calibrated due to the problems raised by the imbalanced label representation. In this paper, we propose three practical solutions to overcome these drawbacks. One is to increase the diversity of the base classifiers in the ensemble. The second to smooth the label powerset probability estimates during the ensemble aggregation process, and the third to calibrate the label decision thresholds. Experimental results on various benchmark data sets indicate that the proposed approach outperforms significantly recent state-of-the-art MLC algorithms, including RAkEL and its variants.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Briggs, F., Fern, X.Z., Irvine, J.: Multi-label classifier chains for bird sound. In: Workshop on Machine Learning for Bioacoustics, ICML 2013 (2013) Briggs, F., Fern, X.Z., Irvine, J.: Multi-label classifier chains for bird sound. In: Workshop on Machine Learning for Bioacoustics, ICML 2013 (2013)
2.
Zurück zum Zitat Demsar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH Demsar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH
3.
Zurück zum Zitat Fan, R., Lin, C.: A study on threshold selection for multi-label classification. Department of Computer Science, National Taiwan University (2007) Fan, R., Lin, C.: A study on threshold selection for multi-label classification. Department of Computer Science, National Taiwan University (2007)
4.
Zurück zum Zitat Marios, I., George S., Tsoumakas, G., Vlahavas, I.: Obtaining bipartitions from score vectors for multi-label classification. In: ICTAI (2010) Marios, I., George S., Tsoumakas, G., Vlahavas, I.: Obtaining bipartitions from score vectors for multi-label classification. In: ICTAI (2010)
5.
Zurück zum Zitat Lipton, Z.C., Elkan, C., Naryanaswamy, B.: Optimal thresholding of classifiers to maximize F1 measure. In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.) ECML PKDD 2014, Part II. LNCS, vol. 8725, pp. 225–239. Springer, Heidelberg (2014) Lipton, Z.C., Elkan, C., Naryanaswamy, B.: Optimal thresholding of classifiers to maximize F1 measure. In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.) ECML PKDD 2014, Part II. LNCS, vol. 8725, pp. 225–239. Springer, Heidelberg (2014)
6.
Zurück zum Zitat Madjarov, G., Kocev, D., Gjorgjevikj, D., Džeroski, S.: An extensive experimental comparison of methods for multi-label learning. Pattern Recogn. 45, 3084–3104 (2012)CrossRef Madjarov, G., Kocev, D., Gjorgjevikj, D., Džeroski, S.: An extensive experimental comparison of methods for multi-label learning. Pattern Recogn. 45, 3084–3104 (2012)CrossRef
7.
Zurück zum Zitat Kouzani, A.Z., Nasierding, G., Tsoumakas, G.: A triple-random ensemble classification method for mining multi-label data. In: Data Mining Workshops, ICDMW 2010 (2010) Kouzani, A.Z., Nasierding, G., Tsoumakas, G.: A triple-random ensemble classification method for mining multi-label data. In: Data Mining Workshops, ICDMW 2010 (2010)
8.
Zurück zum Zitat Pillai, I., Fumera, G., Roli, F.: Threshold optimisation for multi-label classifiers. Pattern Recogn. 46, 2055–2065 (2013)CrossRefMATH Pillai, I., Fumera, G., Roli, F.: Threshold optimisation for multi-label classifiers. Pattern Recogn. 46, 2055–2065 (2013)CrossRefMATH
9.
Zurück zum Zitat Payam, R., Lei, T., Huan, L.: Cross-validation. In: Encyclopedia of Database Systems. Springer, New York (2009) Payam, R., Lei, T., Huan, L.: Cross-validation. In: Encyclopedia of Database Systems. Springer, New York (2009)
10.
Zurück zum Zitat Rokach, L., Schclar, A., Itach, E.: Ensemble methods for multi-label classification. Expert Syst. Appl. 41, 7507–7523 (2014)CrossRef Rokach, L., Schclar, A., Itach, E.: Ensemble methods for multi-label classification. Expert Syst. Appl. 41, 7507–7523 (2014)CrossRef
11.
Zurück zum Zitat Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-labelsets for multilabel classification. IEEE Transa. Knowl. Data Eng. 23, 1079–1089 (2011)CrossRef Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-labelsets for multilabel classification. IEEE Transa. Knowl. Data Eng. 23, 1079–1089 (2011)CrossRef
12.
Zurück zum Zitat Tsoumakas, G., Xioufis, E.S., Vilcek, J., Vlahavas, I.P.: Mulan: a java library for multi-label learning. J. Mach. Learn. Res. 12, 2411–2414 (2011)MathSciNetMATH Tsoumakas, G., Xioufis, E.S., Vilcek, J., Vlahavas, I.P.: Mulan: a java library for multi-label learning. J. Mach. Learn. Res. 12, 2411–2414 (2011)MathSciNetMATH
Metadaten
Titel
Calibrated k-labelsets for Ensemble Multi-label Classification
verfasst von
Ouadie Gharroudi
Haytham Elghazel
Alex Aussem
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-26532-2_63