Skip to main content
Top
Published in: Neural Computing and Applications 2/2018

18-11-2016 | Original Article

Nuclear norm regularized convolutional Max Pos@Top machine

Authors: Qinfeng Li, Xiaofeng Zhou, Aihua Gu, Zonghua Li, Ru-Ze Liang

Published in: Neural Computing and Applications | Issue 2/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we propose a novel classification model for the multiple instance data, which aims to maximize the number of positive instances ranked before the top-ranked negative instances. This method belongs to a recently emerged performance, named as Pos@Top. Our proposed classification model has a convolutional structure that is composed by four layers, i.e., the convolutional layer, the activation layer, the max-pooling layer and the full connection layer. In this paper, we propose an algorithm to learn the convolutional filters and the full connection weights to maximize the Pos@Top measure over the training set. Also, we try to minimize the rank of the filter matrix to explore the low-dimensional space of the instances in conjunction with the classification results. The rank minimization is conducted by the nuclear norm minimization of the filter matrix. In addition, we develop an iterative algorithm to solve the corresponding problem. We test our method on several benchmark datasets. The experimental results show the superiority of our method compared with other state-of-the-art Pos@Top maximization methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M et al (2015) Tensorflow: large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org 1 Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M et al (2015) Tensorflow: large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org 1
2.
go back to reference Agarwal S (2011) The infinite push: a new support vector ranking algorithm that directly optimizes accuracy at the absolute top of the list. In: Proceedings of the 11th SIAM international conference on data mining, SDM 2011, pp 839–850 Agarwal S (2011) The infinite push: a new support vector ranking algorithm that directly optimizes accuracy at the absolute top of the list. In: Proceedings of the 11th SIAM international conference on data mining, SDM 2011, pp 839–850
3.
go back to reference Al Madi, NS, Khan JI (2016) Measuring learning performance and cognitive activity during multimodal comprehension. In: 2016 7th international conference on information and communication systems (ICICS). IEEE, pp 50–55 Al Madi, NS, Khan JI (2016) Measuring learning performance and cognitive activity during multimodal comprehension. In: 2016 7th international conference on information and communication systems (ICICS). IEEE, pp 50–55
4.
go back to reference Bewley A, Upcroft B (2016) From imagenet to mining: adapting visual object detection with minimal supervision. Springer Tracts Adv Robot 113:501–514CrossRef Bewley A, Upcroft B (2016) From imagenet to mining: adapting visual object detection with minimal supervision. Springer Tracts Adv Robot 113:501–514CrossRef
5.
go back to reference Boyd S, Cortes C, Mohri M, Radovanovic A (2012) Accuracy at the top. Adv Neural Inf Process Syst 2:953–961 Boyd S, Cortes C, Mohri M, Radovanovic A (2012) Accuracy at the top. Adv Neural Inf Process Syst 2:953–961
6.
go back to reference Chen S, Wang H, Xu F, Jin YQ (2016) Target classification using the deep convolutional networks for SAR images. IEEE Trans Geosci Remote Sens 54(8):4806–4817CrossRef Chen S, Wang H, Xu F, Jin YQ (2016) Target classification using the deep convolutional networks for SAR images. IEEE Trans Geosci Remote Sens 54(8):4806–4817CrossRef
7.
go back to reference Ding M, Fan G (2013) Multi-layer joint gait-pose manifold for human motion modeling. In: 2013 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8 Ding M, Fan G (2013) Multi-layer joint gait-pose manifold for human motion modeling. In: 2013 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8
8.
go back to reference Ding M, Fan G (2015) Generalized sum of Gaussians for real-time human pose tracking from a single depth sensor. In: 2015 IEEE winter conference on applications of computer vision, pp 47–54 Ding M, Fan G (2015) Generalized sum of Gaussians for real-time human pose tracking from a single depth sensor. In: 2015 IEEE winter conference on applications of computer vision, pp 47–54
9.
go back to reference Ding M, Fan G (2015) Multilayer joint gait-pose manifolds for human gait motion modeling. IEEE Trans Cybern 45(11):2413–2424CrossRef Ding M, Fan G (2015) Multilayer joint gait-pose manifolds for human gait motion modeling. IEEE Trans Cybern 45(11):2413–2424CrossRef
10.
go back to reference Ding M, Fan G (2016) Articulated and generalized Gaussian kernel correlation for human pose estimation. IEEE Trans Image Process 25(2):776–789MathSciNetCrossRef Ding M, Fan G (2016) Articulated and generalized Gaussian kernel correlation for human pose estimation. IEEE Trans Image Process 25(2):776–789MathSciNetCrossRef
11.
go back to reference Fan J, Liang RZ (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance based histogram comparison. Neural Comput Appl 1–11. arXiv:1609.00817v1 Fan J, Liang RZ (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance based histogram comparison. Neural Comput Appl 1–11. arXiv:​1609.​00817v1
12.
go back to reference Goadrich M, Oliphant L, Shavlik J (2006) Gleaner: creating ensembles of first-order clauses to improve recall–precision curves. Mach Learn 64(1–3):231–261CrossRefMATH Goadrich M, Oliphant L, Shavlik J (2006) Gleaner: creating ensembles of first-order clauses to improve recall–precision curves. Mach Learn 64(1–3):231–261CrossRefMATH
13.
go back to reference Gong C, Tao D, Maybank SJ, Liu W, Kang G, Yang J (2016) Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans Image Process 25(7):3249–3260MathSciNetCrossRef Gong C, Tao D, Maybank SJ, Liu W, Kang G, Yang J (2016) Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans Image Process 25(7):3249–3260MathSciNetCrossRef
14.
go back to reference Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical report 7694, California Institute of Technology Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical report 7694, California Institute of Technology
15.
go back to reference Halligan S, Altman DG, Mallett S (2015) Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. Eur Radiol 25(4):932–939CrossRef Halligan S, Altman DG, Mallett S (2015) Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. Eur Radiol 25(4):932–939CrossRef
16.
go back to reference Hammami N, Bedda M, Farah N (2012) Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE conference on sustainable utilization and development in engineering and technology (STUDENT). IEEE, pp 160–163 Hammami N, Bedda M, Farah N (2012) Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE conference on sustainable utilization and development in engineering and technology (STUDENT). IEEE, pp 160–163
17.
go back to reference Harris G, Panangadan A, Prasanna VK (2015) Learning of performance measures from crowd-sourced data with application to ranking of investments. In: Pacific–Asia conference on knowledge discovery and data mining. Springer, pp 538–549 Harris G, Panangadan A, Prasanna VK (2015) Learning of performance measures from crowd-sourced data with application to ranking of investments. In: Pacific–Asia conference on knowledge discovery and data mining. Springer, pp 538–549
18.
go back to reference He K, Zhang X, Ren S, Sun J (2016) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, 11–18 December 2015, pp 1026–1034 He K, Zhang X, Ren S, Sun J (2016) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, 11–18 December 2015, pp 1026–1034
19.
go back to reference Hendrickx I, Kim SN, Kozareva Z, Nakov P, Ó Séaghdha, D, Padó, S, Pennacchiotti, M, Romano, L, Szpakowicz, S (2009) Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the workshop on semantic evaluations: recent achievements and future directions. Association for Computational Linguistics, pp 94–99 Hendrickx I, Kim SN, Kozareva Z, Nakov P, Ó Séaghdha, D, Padó, S, Pennacchiotti, M, Romano, L, Szpakowicz, S (2009) Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the workshop on semantic evaluations: recent achievements and future directions. Association for Computational Linguistics, pp 94–99
20.
go back to reference Hentschel C, Wiradarma T, Sack H (2015) If we did not have imagenet: comparison of fisher encodings and convolutional neural networks on limited training data. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 9475. Springer, Switzerland, pp 400–409 Hentschel C, Wiradarma T, Sack H (2015) If we did not have imagenet: comparison of fisher encodings and convolutional neural networks on limited training data. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 9475. Springer, Switzerland, pp 400–409
21.
go back to reference Jain S, Kashyap R, Kuo TT, Bhargava S, Lin G, Hsu CN (2016) Weakly supervised learning of biomedical information extraction from curated data. BMC Bioinform 17(1):1CrossRef Jain S, Kashyap R, Kuo TT, Bhargava S, Lin G, Hsu CN (2016) Weakly supervised learning of biomedical information extraction from curated data. BMC Bioinform 17(1):1CrossRef
22.
go back to reference Joachims T (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning. ACM, pp 377–384 Joachims T (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning. ACM, pp 377–384
23.
go back to reference Li N, Jin R, Zhou ZH (2014) Top rank optimization in linear time. Adv Neural Inf Process Syst 2:1502–1510 Li N, Jin R, Zhou ZH (2014) Top rank optimization in linear time. Adv Neural Inf Process Syst 2:1502–1510
24.
go back to reference Liang RZ, Shi L, Wang H, Meng J, Wang JJY, Sun Q, Gu, Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. In: 2016 23st International conference on pattern recognition (ICPR). IEEE Liang RZ, Shi L, Wang H, Meng J, Wang JJY, Sun Q, Gu, Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. In: 2016 23st International conference on pattern recognition (ICPR). IEEE
25.
go back to reference Liang RZ, Xie W, Li W, Wang H, Wang JJY, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. In: 2016 IEEE 28th international conference on tools with artificial intelligence (ICTAI) Liang RZ, Xie W, Li W, Wang H, Wang JJY, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. In: 2016 IEEE 28th international conference on tools with artificial intelligence (ICTAI)
26.
go back to reference Lin F, Wang J, Zhang N, Xiahou J, McDonald N (2016) Multi-kernel learning for multivariate performance measures optimization. Neural Comput Appl 1–13. arXiv:1508.06264v1 Lin F, Wang J, Zhang N, Xiahou J, McDonald N (2016) Multi-kernel learning for multivariate performance measures optimization. Neural Comput Appl 1–13. arXiv:​1508.​06264v1
27.
go back to reference Lu S, Lu H, Kolarik WJ (2001) Multivariate performance reliability prediction in real-time. Reliab Eng Syst Saf 72(1):39–45CrossRef Lu S, Lu H, Kolarik WJ (2001) Multivariate performance reliability prediction in real-time. Reliab Eng Syst Saf 72(1):39–45CrossRef
28.
go back to reference Madsen ME, Konge L, Nørgaard LN, Tabor A, Ringsted C, Klemmensen Å, Ottesen B, Tolsgaard MG (2014) Assessment of performance measures and learning curves for use of a virtual-reality ultrasound simulator in transvaginal ultrasound examination. Ultrasound Obstet Gynecol 44(6):693–699CrossRef Madsen ME, Konge L, Nørgaard LN, Tabor A, Ringsted C, Klemmensen Å, Ottesen B, Tolsgaard MG (2014) Assessment of performance measures and learning curves for use of a virtual-reality ultrasound simulator in transvaginal ultrasound examination. Ultrasound Obstet Gynecol 44(6):693–699CrossRef
29.
go back to reference Mao Q, Tsang IWH (2013) A feature selection method for multivariate performance measures. IEEE Trans Pattern Anal Mach Intell 35(9):2051–2063CrossRef Mao Q, Tsang IWH (2013) A feature selection method for multivariate performance measures. IEEE Trans Pattern Anal Mach Intell 35(9):2051–2063CrossRef
30.
go back to reference Martinel N, Piciarelli C, Micheloni C (2016) A supervised extreme learning committee for food recognition. Comput Vis Image Underst 148:67–86CrossRef Martinel N, Piciarelli C, Micheloni C (2016) A supervised extreme learning committee for food recognition. Comput Vis Image Underst 148:67–86CrossRef
31.
go back to reference Meyen A, Sooriyarachchi M (2016) Simulation study of a novel method for comparing more than two independent receiver operating characteristic (ROC) curves based on the area under the curves (AUCS). J Natl Sci Found Sri Lanka 43(4):357–367CrossRef Meyen A, Sooriyarachchi M (2016) Simulation study of a novel method for comparing more than two independent receiver operating characteristic (ROC) curves based on the area under the curves (AUCS). J Natl Sci Found Sri Lanka 43(4):357–367CrossRef
32.
go back to reference Patel M, Agius S, Wilkinson J, Patel L, Baker P (2016) Value of supervised learning events in predicting doctors in difficulty. Med Educ 50(7):746–756CrossRef Patel M, Agius S, Wilkinson J, Patel L, Baker P (2016) Value of supervised learning events in predicting doctors in difficulty. Med Educ 50(7):746–756CrossRef
33.
go back to reference Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg A, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg A, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef
34.
go back to reference Shih SM, Wu WH, Hsieh HN (2016) A non-inferiority test for diagnostic accuracy in the absence of the golden standard test based on the paired partial areas under receiver operating characteristic curves. J Appl Stat 43(3):550–562MathSciNetCrossRef Shih SM, Wu WH, Hsieh HN (2016) A non-inferiority test for diagnostic accuracy in the absence of the golden standard test based on the paired partial areas under receiver operating characteristic curves. J Appl Stat 43(3):550–562MathSciNetCrossRef
35.
go back to reference Xu S, Xu L, Zhan Z, Ye K, Han K, Born F (2014) Method and system for resilient and adaptive detection of malicious websites. US Patent WO2013184653 A1 Xu S, Xu L, Zhan Z, Ye K, Han K, Born F (2014) Method and system for resilient and adaptive detection of malicious websites. US Patent WO2013184653 A1
36.
go back to reference Sofotasios PC, Fikadu MK, Ho-Van K, Valkama M, Karagiannidis GK (2014) The area under a receiver operating characteristic curve over enriched multipath fading conditions. In: 2014 IEEE global communications conference. IEEE, pp 3490–3495 Sofotasios PC, Fikadu MK, Ho-Van K, Valkama M, Karagiannidis GK (2014) The area under a receiver operating characteristic curve over enriched multipath fading conditions. In: 2014 IEEE global communications conference. IEEE, pp 3490–3495
37.
go back to reference Sun F, Guo J, Lan Y, Xu J, Cheng X (2015) Learning word representations by jointly modeling syntagmatic and paradigmatic relations. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, Beijing, China, 26–31 July 2015, pp 136–145 Sun F, Guo J, Lan Y, Xu J, Cheng X (2015) Learning word representations by jointly modeling syntagmatic and paradigmatic relations. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, Beijing, China, 26–31 July 2015, pp 136–145
38.
go back to reference Takeda A, Kanamori T (2014) Using financial risk measures for analyzing generalization performance of machine learning models. Neural Netw 57:29–38CrossRefMATH Takeda A, Kanamori T (2014) Using financial risk measures for analyzing generalization performance of machine learning models. Neural Netw 57:29–38CrossRefMATH
39.
go back to reference Tang B, Liu X, Lei J, Song M, Tao D, Sun S, Dong F (2016) Deepchart: combining deep convolutional networks and deep belief networks in chart classification. Signal Process 124:156–161CrossRef Tang B, Liu X, Lei J, Song M, Tao D, Sun S, Dong F (2016) Deepchart: combining deep convolutional networks and deep belief networks in chart classification. Signal Process 124:156–161CrossRef
40.
go back to reference Wang CY, Peng DY, Xu L, Yi XS (2007) Gradual gray-watermark embedding algorithm in the wavelet domain. J Comput Appl 6:025 Wang CY, Peng DY, Xu L, Yi XS (2007) Gradual gray-watermark embedding algorithm in the wavelet domain. J Comput Appl 6:025
41.
go back to reference Wang J, Wang H, Zhou Y, McDonald N (2015) Multiple kernel multivariate performance learning using cutting plane algorithm. In: 2015 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 1870–1875 Wang J, Wang H, Zhou Y, McDonald N (2015) Multiple kernel multivariate performance learning using cutting plane algorithm. In: 2015 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 1870–1875
42.
go back to reference Wang JJY, Tsang IWH, Gao X (2016) Optimizing multivariate performance measures from multi-view data. In: Thirtieth AAAI conference on artificial intelligence, pp 2152—2158 Wang JJY, Tsang IWH, Gao X (2016) Optimizing multivariate performance measures from multi-view data. In: Thirtieth AAAI conference on artificial intelligence, pp 2152—2158
43.
go back to reference Wang L, Scott K, Xu L, Clausi D (2016) Sea ice concentration estimation during melt from dual-pol SAR scenes using deep convolutional neural networks: a case study. IEEE Trans Geosci Remote Sens 54(8):4524–4533CrossRef Wang L, Scott K, Xu L, Clausi D (2016) Sea ice concentration estimation during melt from dual-pol SAR scenes using deep convolutional neural networks: a case study. IEEE Trans Geosci Remote Sens 54(8):4524–4533CrossRef
44.
go back to reference Xu L, Zhan Z, Xu S, Ye K (2013) Cross-layer detection of malicious websites. In: Proceedings of the third ACM conference on data and application security and privacy. ACM, pp 141–152 Xu L, Zhan Z, Xu S, Ye K (2013) Cross-layer detection of malicious websites. In: Proceedings of the third ACM conference on data and application security and privacy. ACM, pp 141–152
45.
go back to reference Xu L, Zhan Z, Xu S, Ye K (2014) An evasion and counter-evasion study in malicious websites detection. In: 2014 IEEE conference on communications and network security (CNS). IEEE, pp 265–273 Xu L, Zhan Z, Xu S, Ye K (2014) An evasion and counter-evasion study in malicious websites detection. In: 2014 IEEE conference on communications and network security (CNS). IEEE, pp 265–273
46.
go back to reference Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) Dropsample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten chinese character recognition. Pattern Recogn 58:190–203CrossRef Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) Dropsample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten chinese character recognition. Pattern Recogn 58:190–203CrossRef
47.
go back to reference Zahedi M, Sorkhi A (2013) Improving text classification performance using PCA and recall–precision criteria. Arab J Sci Eng 38(8):2095–2102CrossRef Zahedi M, Sorkhi A (2013) Improving text classification performance using PCA and recall–precision criteria. Arab J Sci Eng 38(8):2095–2102CrossRef
48.
go back to reference Zhang P, Su W (2012) Statistical inference on recall, precision and average precision under random selection. In: Proceedings—2012 9th international conference on fuzzy systems and knowledge discovery, FSKD 2012, pp 1348–1352 Zhang P, Su W (2012) Statistical inference on recall, precision and average precision under random selection. In: Proceedings—2012 9th international conference on fuzzy systems and knowledge discovery, FSKD 2012, pp 1348–1352
49.
go back to reference Zokaei N, Burnett Heyes S, Gorgoraptis N, Budhdeo S, Husain M (2015) Working memory recall precision is a more sensitive index than span. J Neuropsychol 9(2):319–329CrossRef Zokaei N, Burnett Heyes S, Gorgoraptis N, Budhdeo S, Husain M (2015) Working memory recall precision is a more sensitive index than span. J Neuropsychol 9(2):319–329CrossRef
Metadata
Title
Nuclear norm regularized convolutional Max Pos@Top machine
Authors
Qinfeng Li
Xiaofeng Zhou
Aihua Gu
Zonghua Li
Ru-Ze Liang
Publication date
18-11-2016
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 2/2018
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-016-2680-2

Other articles of this Issue 2/2018

Neural Computing and Applications 2/2018 Go to the issue

Premium Partner