Skip to main content
Top
Published in: Neural Processing Letters 1/2017

28-04-2016

Hierarchical Multilabel Classification with Optimal Path Prediction

Authors: Zhengya Sun, Yangyang Zhao, Dong Cao, Hongwei Hao

Published in: Neural Processing Letters | Issue 1/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We consider multilabel classification problems where the labels are arranged hierarchically in a tree or directed acyclic graph (DAG). In this context, it is of much interest to select a well-connected subset of nodes which best preserve the label dependencies according to the learned models. Top-down or bottom-up procedures for labelling the nodes in the hierarchy have recently been proposed, but they rely largely on pairwise interactions, thus susceptible to get stuck in local optima. In this paper, we remedy this problem by directly finding a small number of label paths that can cover the desired subgraph in a tree/DAG. To estimate the high-dimensional label vector, we adopt the advantages of partial least squares techniques which perform simultaneous projections of the feature and label space, while constructing sound linear models between them. We then show that the optimal label prediction problem with hierarchy constraints can be reasonably transformed into the optimal path prediction problem with the structured sparsity penalties. The introduction of path selection models further allows us to leverage the efficient network flow solvers with polynomial time complexity. The experimental results validate the promising performance of the proposed algorithm in comparison to the state-of-the-art algorithms on both tree- and DAG-structured data sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Barros RC, Cerri R, Freitas AA, de Carvalho ACPLF (2013) Probabilistic clustering for hierarchical multi-label classification of protein functions. In: Machine learning and knowledge discovery in databases, proceedings, part II, pp 385–400 Barros RC, Cerri R, Freitas AA, de Carvalho ACPLF (2013) Probabilistic clustering for hierarchical multi-label classification of protein functions. In: Machine learning and knowledge discovery in databases, proceedings, part II, pp 385–400
2.
go back to reference Barutcuoglu Z, Schapire RE, Troyanskaya OG (2006) Hierarchical multi-label prediction of gene function. Bioinformatics 22(7):830–836CrossRef Barutcuoglu Z, Schapire RE, Troyanskaya OG (2006) Hierarchical multi-label prediction of gene function. Bioinformatics 22(7):830–836CrossRef
3.
go back to reference Bi W, Kwok JT (2011) Multi-label classification on tree- and dag-structured hierarchies. In: Proceedings of the 28th international conference on machine learning, pp 17–24 Bi W, Kwok JT (2011) Multi-label classification on tree- and dag-structured hierarchies. In: Proceedings of the 28th international conference on machine learning, pp 17–24
4.
go back to reference Bi W, Kwok JT (2012) Hierarchical multilabel classification with minimum bayes risk. In: Proceedings of the 12th IEEE international conference on data mining, pp 101–110 Bi W, Kwok JT (2012) Hierarchical multilabel classification with minimum bayes risk. In: Proceedings of the 12th IEEE international conference on data mining, pp 101–110
5.
go back to reference Bi W, Kwok JT (2014) Mandatory leaf node prediction in hierarchical multilabel classification. IEEE Trans Neural Netw Learn Syst 25(12):2275–2287CrossRef Bi W, Kwok JT (2014) Mandatory leaf node prediction in hierarchical multilabel classification. IEEE Trans Neural Netw Learn Syst 25(12):2275–2287CrossRef
6.
go back to reference Blockeel H, Schietgat L, Struyf J, Džeroski S, Clare A (2006) Decision trees for hierarchical multilabel classification: a case study in functional genomics. In: Proceedings of the 10th European conference on principles of data mining and knowledge discovery, pp 18–29 Blockeel H, Schietgat L, Struyf J, Džeroski S, Clare A (2006) Decision trees for hierarchical multilabel classification: a case study in functional genomics. In: Proceedings of the 10th European conference on principles of data mining and knowledge discovery, pp 18–29
7.
go back to reference Cerri R, Barros RC, de Carvalho ACPLF (2011) Hierarchical multi-label classification for protein function prediction: a local approach based on neural networks. In: Intelligent systems design and applications, pp 337–343 Cerri R, Barros RC, de Carvalho ACPLF (2011) Hierarchical multi-label classification for protein function prediction: a local approach based on neural networks. In: Intelligent systems design and applications, pp 337–343
8.
go back to reference Cerri R, Barros RC, de Carvalho ACPLF (2014) Hierarchical multi-label classification using local neural networks. J Comput Syst Sci 80:39–56MathSciNetCrossRefMATH Cerri R, Barros RC, de Carvalho ACPLF (2014) Hierarchical multi-label classification using local neural networks. J Comput Syst Sci 80:39–56MathSciNetCrossRefMATH
9.
go back to reference Cerri R, Barros RC, de Carvalho ACPLF (2015) Hierarchical classification of gene ontology-based protein functions with neural networks. In Proceedings of the 2015 international joint conference on neural networks, pp 1–8 Cerri R, Barros RC, de Carvalho ACPLF (2015) Hierarchical classification of gene ontology-based protein functions with neural networks. In Proceedings of the 2015 international joint conference on neural networks, pp 1–8
10.
go back to reference Cesa-bianchi N, Zaniboni L, Collins M (2004) Incremental algorithms for hierarchical classification. J Mach Learn Res 7:31–54MathSciNetMATH Cesa-bianchi N, Zaniboni L, Collins M (2004) Incremental algorithms for hierarchical classification. J Mach Learn Res 7:31–54MathSciNetMATH
11.
go back to reference Cesa-bianchi N, Gentile C, Zaniboni L (2006) Hierarchical classification: combining bayes with SVM. In: Proceedings of the 23rd international conference on machine learning, pp 177–184 Cesa-bianchi N, Gentile C, Zaniboni L (2006) Hierarchical classification: combining bayes with SVM. In: Proceedings of the 23rd international conference on machine learning, pp 177–184
12.
go back to reference Clare A (2003) Machine learning and data mining for yeast functional genomics. Ph.D. Thesis, University of Wales, Aberystwyth Clare A (2003) Machine learning and data mining for yeast functional genomics. Ph.D. Thesis, University of Wales, Aberystwyth
13.
go back to reference Grauman K, Sha F, Hwang SJ (2011) Learning a tree of metrics with disjoint visual features. In: Advances in neural information processing systems 24, pp 621–629 Grauman K, Sha F, Hwang SJ (2011) Learning a tree of metrics with disjoint visual features. In: Advances in neural information processing systems 24, pp 621–629
14.
go back to reference Hariharan B, Zelnik-Manor L, Vishwanathan SVN, Varma M (2010) Large scale max-margin multi-label classification with priors. In: Proceedings of the 27th international conference on machine learning, pp 423–430 Hariharan B, Zelnik-Manor L, Vishwanathan SVN, Varma M (2010) Large scale max-margin multi-label classification with priors. In: Proceedings of the 27th international conference on machine learning, pp 423–430
15.
go back to reference Hernandez J, Sucar LE, Morales EF (2013) A hybrid global-local approach for hierarchical classification. In: Proceedings of the twenty-sixty international Florida artificial intelligence research society conference, pp 432–437 Hernandez J, Sucar LE, Morales EF (2013) A hybrid global-local approach for hierarchical classification. In: Proceedings of the twenty-sixty international Florida artificial intelligence research society conference, pp 432–437
16.
go back to reference Kiritchenko S, Matwin S, Famili AF (2004) Hierarchical text categorization as a tool of associating genes with gene ontology codes. In: European workshop on data mining and text mining in bioinformatics, pp 30–34 Kiritchenko S, Matwin S, Famili AF (2004) Hierarchical text categorization as a tool of associating genes with gene ontology codes. In: European workshop on data mining and text mining in bioinformatics, pp 30–34
17.
go back to reference Ramírez-Corona M, Sucar LE, Morales EF (2014) Chained path evaluation for hierarchical multi-label classification. In Proceedings of the twenty-seventh international Florida artificial intelligence research society conference, pp 502–507 Ramírez-Corona M, Sucar LE, Morales EF (2014) Chained path evaluation for hierarchical multi-label classification. In Proceedings of the twenty-seventh international Florida artificial intelligence research society conference, pp 502–507
18.
go back to reference Rosipal R, Krämer N (2006) Overview and recent advances in partial least squares. In: Subspace, latent structure and feature selection techniques, pp 34–51 Rosipal R, Krämer N (2006) Overview and recent advances in partial least squares. In: Subspace, latent structure and feature selection techniques, pp 34–51
19.
go back to reference Rousu J, Saunders C, Szedmák S, Shawe-Taylor J (2006) Kernel-based learning of hierarchical multilabel classification models. J Mach Learn Res 7:1601–1626MathSciNetMATH Rousu J, Saunders C, Szedmák S, Shawe-Taylor J (2006) Kernel-based learning of hierarchical multilabel classification models. J Mach Learn Res 7:1601–1626MathSciNetMATH
20.
go back to reference Silla CN Jr, Freitas AA (2011) A survey of hierarchical classification across different application domains. Data Min Knowl Disc 22(1–2):31–72MathSciNetCrossRefMATH Silla CN Jr, Freitas AA (2011) A survey of hierarchical classification across different application domains. Data Min Knowl Disc 22(1–2):31–72MathSciNetCrossRefMATH
21.
go back to reference Vens C, Struyf J, Schietgat L, Džeroski S, Blockeel H (2008) Decision trees for hierarchical multi-label classification. Mach Learn 73(2):185–214CrossRef Vens C, Struyf J, Schietgat L, Džeroski S, Blockeel H (2008) Decision trees for hierarchical multi-label classification. Mach Learn 73(2):185–214CrossRef
22.
go back to reference Wang P, Zhang P, Guo L (2012) Mining multi-label data streams using ensemble-based active learning. In: Proceedings of the 12th SIAM international conference on data mining, pp 1131–1140 Wang P, Zhang P, Guo L (2012) Mining multi-label data streams using ensemble-based active learning. In: Proceedings of the 12th SIAM international conference on data mining, pp 1131–1140
23.
go back to reference Wold H (1975) Path models with latent variables: the nipals approach. In: Quantitative sociology: international perspectives on mathematical and statistical model building, pp 307–357 Wold H (1975) Path models with latent variables: the nipals approach. In: Quantitative sociology: international perspectives on mathematical and statistical model building, pp 307–357
24.
go back to reference Wold S, Martens H, Wold H (1983) The multivariate calibration problem in chemistry solved by the pls method. In: Matrix pencils, pp 286–293 Wold S, Martens H, Wold H (1983) The multivariate calibration problem in chemistry solved by the pls method. In: Matrix pencils, pp 286–293
25.
go back to reference Zhou D, Xiao L, Wu M (2011) Hierarchical classification via orthogonal transfer. In: Proceedings of the 28th international conference on machine learning, pp 801–808 Zhou D, Xiao L, Wu M (2011) Hierarchical classification via orthogonal transfer. In: Proceedings of the 28th international conference on machine learning, pp 801–808
Metadata
Title
Hierarchical Multilabel Classification with Optimal Path Prediction
Authors
Zhengya Sun
Yangyang Zhao
Dong Cao
Hongwei Hao
Publication date
28-04-2016
Publisher
Springer US
Published in
Neural Processing Letters / Issue 1/2017
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-016-9526-x

Other articles of this Issue 1/2017

Neural Processing Letters 1/2017 Go to the issue