Skip to main content
Erschienen in: Neural Computing and Applications 35/2023

14.12.2022 | S.I.: Applications and Techniques in Cyber Intelligence (ATCI2022)

Dual dimensionality reduction on instance-level and feature-level for multi-label data

verfasst von: Haikun Li, Min Fang, Peng Wang

Erschienen in: Neural Computing and Applications | Ausgabe 35/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The training data in multi-label learning are often high dimensional and contains a quantity of noise and redundant information, resulting in high memory overhead and low classification performance during the learning process. Therefore, dimensionality reduction for multi-label data has become an important research topic. Existing dimensionality reduction methods for multi-label data focus on either the instance-level or the feature-level; few studies have achieved both. This paper proposes a novel two-stage method to reduce dimensionality for both instances and features on multi-label data. In the dimensionality reduction stage of instances, the original training data are converted into single-label data utilizing binary relevance. The learning vector quantization technique is employed to perform prototype selection on the transformed data and generate new instance-level low-dimensional multi-label data on the ground of the nearest neighbor information of the selected prototypes. Next, a filter-based feature selection method is proposed to choose discriminative features for each class label in the feature reduction phase. The number of retained features is determined according to the preset proportion parameters to achieve the feature-level dimensionality reduction. Experimental results on seven benchmarks verify the effectiveness of the proposed method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
These benchmark datasets were sourced from: https://​mulan.​sourceforge.​net/​datasets-mlc.​html.
 
Literatur
1.
Zurück zum Zitat Zhang M-L, Zhou Z-H (2006) Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans Knowl Data Eng 18(10):1338–1351CrossRef Zhang M-L, Zhou Z-H (2006) Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans Knowl Data Eng 18(10):1338–1351CrossRef
2.
Zurück zum Zitat Zhang M, Li Y, Liu X, Geng X (2018) Binary relevance for multi-label learning: an overview. Front Comp Sci 12(2):191–202CrossRef Zhang M, Li Y, Liu X, Geng X (2018) Binary relevance for multi-label learning: an overview. Front Comp Sci 12(2):191–202CrossRef
3.
Zurück zum Zitat Boutell MR, Luo J, Shen X, Brown CM (2004) Learning multi-label scene classification. Pattern Recognit 37(9):1757–1771CrossRef Boutell MR, Luo J, Shen X, Brown CM (2004) Learning multi-label scene classification. Pattern Recognit 37(9):1757–1771CrossRef
4.
Zurück zum Zitat Calvo-Zaragoza J, Valero-Mas J, Rico-Juan J (2015) Improving kNN multi-label classification in prototype selection scenarios using class proposals. Pattern Recognit 48(5):1608–1622CrossRef Calvo-Zaragoza J, Valero-Mas J, Rico-Juan J (2015) Improving kNN multi-label classification in prototype selection scenarios using class proposals. Pattern Recognit 48(5):1608–1622CrossRef
5.
Zurück zum Zitat Lin Y, Li Y, Wang C, Chen J (2018) Attribute reduction for multi-label learning with fuzzy rough set. Knowl Based Syst 152:51–61CrossRef Lin Y, Li Y, Wang C, Chen J (2018) Attribute reduction for multi-label learning with fuzzy rough set. Knowl Based Syst 152:51–61CrossRef
6.
Zurück zum Zitat Huang J et al (2019) Improving multi-label classification with missing labels by learning label-specific features. Inf Sci 492:124–146MathSciNetCrossRefMATH Huang J et al (2019) Improving multi-label classification with missing labels by learning label-specific features. Inf Sci 492:124–146MathSciNetCrossRefMATH
7.
Zurück zum Zitat Read J, Pfahringer B, Holmes G, Frank E (2011) Classifier chains for multi-label classification. Mach Learn 85(3):333–359MathSciNetCrossRef Read J, Pfahringer B, Holmes G, Frank E (2011) Classifier chains for multi-label classification. Mach Learn 85(3):333–359MathSciNetCrossRef
8.
Zurück zum Zitat Tsoumakas G, Katakis I, Vlahavas I (2011) Random k-Labelsets for multilabel classification. IEEE Trans Knowl Data Eng 23(7):1079–1089CrossRef Tsoumakas G, Katakis I, Vlahavas I (2011) Random k-Labelsets for multilabel classification. IEEE Trans Knowl Data Eng 23(7):1079–1089CrossRef
9.
Zurück zum Zitat Hüllermeier E, Fürnkranz J, Cheng W, Brinker K (2008) Label ranking by learning pairwise preferences. Artif Intell 172(16–17):1897–1916MathSciNetCrossRefMATH Hüllermeier E, Fürnkranz J, Cheng W, Brinker K (2008) Label ranking by learning pairwise preferences. Artif Intell 172(16–17):1897–1916MathSciNetCrossRefMATH
10.
Zurück zum Zitat Fürnkranz J, Hüllermeier E, Loza Mencía E, Brinker K (2008) Multilabel classification via calibrated label ranking. Mach Learn 73(2):133–153CrossRefMATH Fürnkranz J, Hüllermeier E, Loza Mencía E, Brinker K (2008) Multilabel classification via calibrated label ranking. Mach Learn 73(2):133–153CrossRefMATH
11.
Zurück zum Zitat Zhang M, Zhou Z (2007) ML-KNN: a lazy learning approach to multi-label learning. Pattern Recognit 40(7):2038–2048CrossRefMATH Zhang M, Zhou Z (2007) ML-KNN: a lazy learning approach to multi-label learning. Pattern Recognit 40(7):2038–2048CrossRefMATH
13.
Zurück zum Zitat Jia X, Zhu S, Li W (2020) Joint label-specific features and correlation information for multi-label learning. J Comput Sci Technol 35(2):247–258CrossRef Jia X, Zhu S, Li W (2020) Joint label-specific features and correlation information for multi-label learning. J Comput Sci Technol 35(2):247–258CrossRef
14.
Zurück zum Zitat Madjarov G, Kocev D, Gjorgjevikj D, Džeroski S (2012) An extensive experimental comparison of methods for multi-label learning. Pattern Recognit 45(9):3084–3104CrossRef Madjarov G, Kocev D, Gjorgjevikj D, Džeroski S (2012) An extensive experimental comparison of methods for multi-label learning. Pattern Recognit 45(9):3084–3104CrossRef
15.
16.
Zurück zum Zitat Charte F, Rivera AJ, Jesus MJ (2014) MLeNN: a first approach to heuristic multilabel undersampling. IDEAL. Springer, pp 1–9 Charte F, Rivera AJ, Jesus MJ (2014) MLeNN: a first approach to heuristic multilabel undersampling. IDEAL. Springer, pp 1–9
17.
Zurück zum Zitat Kanj S, Abdallah F, Denœux T, Tout K (2015) Editing training data for multi-label classification with the k-nearest neighbor rule. Pattern Anal Appl 19(1):145–161MathSciNetCrossRef Kanj S, Abdallah F, Denœux T, Tout K (2015) Editing training data for multi-label classification with the k-nearest neighbor rule. Pattern Anal Appl 19(1):145–161MathSciNetCrossRef
18.
Zurück zum Zitat Arnaiz-González Á, Díez-Pastor J et al (2018) Study of data transformation techniques for adapting single-label prototype selection algorithms to multi-label learning. Expert Syst Appl 109:114–130CrossRef Arnaiz-González Á, Díez-Pastor J et al (2018) Study of data transformation techniques for adapting single-label prototype selection algorithms to multi-label learning. Expert Syst Appl 109:114–130CrossRef
19.
Zurück zum Zitat Lin Y, Hu Q, Liu J, Duan J (2015) Multi-label feature selection based on max-dependency and min-redundancy. Neurocomputing 168:92–103CrossRef Lin Y, Hu Q, Liu J, Duan J (2015) Multi-label feature selection based on max-dependency and min-redundancy. Neurocomputing 168:92–103CrossRef
20.
Zurück zum Zitat Lin Y, Hu Q, Liu J, Chen J, Duan J (2016) Multi-label feature selection based on neighborhood mutual information. Appl Soft Comput 38:244–256CrossRef Lin Y, Hu Q, Liu J, Chen J, Duan J (2016) Multi-label feature selection based on neighborhood mutual information. Appl Soft Comput 38:244–256CrossRef
21.
Zurück zum Zitat Zhang M, Peña J, Robles V (2009) Feature selection for multi-label naive Bayes classification. Inf Sci 179(19):3218–3229CrossRefMATH Zhang M, Peña J, Robles V (2009) Feature selection for multi-label naive Bayes classification. Inf Sci 179(19):3218–3229CrossRefMATH
22.
Zurück zum Zitat Zhang J, Luo Z, Li C, Zhou C, Li S (2019) Manifold regularized discriminative feature selection for multi-label learning. Pattern Recognit 95:136–150CrossRef Zhang J, Luo Z, Li C, Zhou C, Li S (2019) Manifold regularized discriminative feature selection for multi-label learning. Pattern Recognit 95:136–150CrossRef
23.
Zurück zum Zitat Kohonen T (1997) Learning vector quantization. Self-organizing maps. Springer, Berlin, pp 203–217CrossRefMATH Kohonen T (1997) Learning vector quantization. Self-organizing maps. Springer, Berlin, pp 203–217CrossRefMATH
24.
Zurück zum Zitat Huang L, Tang J, Sun D, Luo B (2013) Feature selection algorithm based on multi-label ReliefF. J Comput Appl 32(10):2888–2890 Huang L, Tang J, Sun D, Luo B (2013) Feature selection algorithm based on multi-label ReliefF. J Comput Appl 32(10):2888–2890
Metadaten
Titel
Dual dimensionality reduction on instance-level and feature-level for multi-label data
verfasst von
Haikun Li
Min Fang
Peng Wang
Publikationsdatum
14.12.2022
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 35/2023
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-08117-0

Weitere Artikel der Ausgabe 35/2023

Neural Computing and Applications 35/2023 Zur Ausgabe

S.I.: Applications and Techniques in Cyber Intelligence (ATCI2022)

Computer-aided digital media art creation based on artificial intelligence

S.I. : Applications and Techniques in Cyber Intelligence (ATCI2022)

MapReduce-based distributed tensor clustering algorithm

S.I.: Applications and Techniques in Cyber Intelligence (ATCI2022)

Sentiment recognition and analysis method of official document text based on BERT–SVM model

Premium Partner