Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 1/2016

01.02.2016 | Original Article

A risk degree-based safe semi-supervised learning algorithm

verfasst von: Haitao Gan, ZhiZeng Luo, Ming Meng, Yuliang Ma, Qingshan She

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 1/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Semi-supervised learning has attracted much attention in machine learning field over the past decades and a number of algorithms are proposed to improve the performance by exploiting unlabeled data. However, unlabeled data may hurt performance of semi-supervised learning in some cases. It is instinctively expected to design a reasonable strategy to safety exploit unlabeled data. To address the problem, we introduce a safe semi-supervised learning by analyzing the different characteristics of unlabeled data in supervised and semi-supervised learning. Our intuition is that unlabeled data may be often risky in semi-supervised setting and the risk degree are different. Hence, we assign different risk degree to unlabeled data and the risk degree serve as a sieve to determine the exploiting way of unlabeled data. The unlabeled data with high risk should be exploited by supervised learning and the other should be used for semi-supervised learning. In particular, we utilize kernel minimum squared error (KMSE) and Laplacian regularized KMSE for supervised and semi-supervised learning, respectively. Experimental results on several benchmark datasets illustrate the performance of our algorithm is never inferior to that of KMSE and indicate the effectiveness and efficiency of our algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH
2.
Zurück zum Zitat Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the 11th annual conference on computational learning theory. ACM, New York, NY, USA, pp 92–100 Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the 11th annual conference on computational learning theory. ACM, New York, NY, USA, pp 92–100
3.
Zurück zum Zitat Cao Y, He (Helen) H, Huang H (2011) Lift: a new framework of learning from testing data for face recognition. Neurocomputing 74(6):916–929CrossRef Cao Y, He (Helen) H, Huang H (2011) Lift: a new framework of learning from testing data for face recognition. Neurocomputing 74(6):916–929CrossRef
5.
Zurück zum Zitat Chapelle O, Scholkopf B, Zien A (eds) (2006) Semi-supervised learning. MIT Press, Cambridge Chapelle O, Scholkopf B, Zien A (eds) (2006) Semi-supervised learning. MIT Press, Cambridge
6.
Zurück zum Zitat Chen H, Li L, Peng J (2009) Error bounds of multi-graph regularized semi-supervised classification. Inf Sci 179(12):1960–1969MathSciNetCrossRefMATH Chen H, Li L, Peng J (2009) Error bounds of multi-graph regularized semi-supervised classification. Inf Sci 179(12):1960–1969MathSciNetCrossRefMATH
7.
Zurück zum Zitat Chen S, Li S, Su S, Cao D, Ji R (2014) Online semi-supervised compressive coding for robust visual tracking. J Vis Commun Image Rep 25(5):793–804CrossRef Chen S, Li S, Su S, Cao D, Ji R (2014) Online semi-supervised compressive coding for robust visual tracking. J Vis Commun Image Rep 25(5):793–804CrossRef
8.
Zurück zum Zitat Gan H, Sang N, Chen X (2013) Semi-supervised kernel minimum squared error based on manifold structure. In: Proceedings of the 10th international symposium on neural networks, vol 7951. Springer-Verlag, Berlin, Heidelberg, pp 265–272 Gan H, Sang N, Chen X (2013) Semi-supervised kernel minimum squared error based on manifold structure. In: Proceedings of the 10th international symposium on neural networks, vol 7951. Springer-Verlag, Berlin, Heidelberg, pp 265–272
9.
Zurück zum Zitat Gan H, Sang N, Huang R (2014) Self-training-based face recognition using semi-supervised linear discriminant analysis and affinity propagation. J Opt Soc Am A 31(1):1–6CrossRef Gan H, Sang N, Huang R (2014) Self-training-based face recognition using semi-supervised linear discriminant analysis and affinity propagation. J Opt Soc Am A 31(1):1–6CrossRef
10.
Zurück zum Zitat Gan H, Sang N, Huang R, Tong X, Dan Z (2013) Using clustering analysis to improve semi-supervised classification. Neurocomputing 101:290–298CrossRef Gan H, Sang N, Huang R, Tong X, Dan Z (2013) Using clustering analysis to improve semi-supervised classification. Neurocomputing 101:290–298CrossRef
11.
Zurück zum Zitat Grabner Helmut LC, Horst B (2008) Semi-supervised on-line boosting for robust tracking. In: Proceedings of the 10th European conference on computer vision: part I. Springer-Verlag, Berlin, Heidelberg, pp 234–247 Grabner Helmut LC, Horst B (2008) Semi-supervised on-line boosting for robust tracking. In: Proceedings of the 10th European conference on computer vision: part I. Springer-Verlag, Berlin, Heidelberg, pp 234–247
12.
Zurück zum Zitat Joachims T (1999) Transductive inference for text classification using support vector machines. In: Proceedings of the 16th international conference on machine learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp 200–209 Joachims T (1999) Transductive inference for text classification using support vector machines. In: Proceedings of the 16th international conference on machine learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp 200–209
13.
Zurück zum Zitat Li Y, Zhou Z (2011) Improving semi-supervised support vector machines through unlabeled instances selection. In: Proceedings of the 25th AAAI conference on artificial intelligence. AAAI Press, pp 500–505 Li Y, Zhou Z (2011) Improving semi-supervised support vector machines through unlabeled instances selection. In: Proceedings of the 25th AAAI conference on artificial intelligence. AAAI Press, pp 500–505
14.
Zurück zum Zitat Li Y, Zhou Z (2011) Towards making unlabeled data never hurt. In: Proceedings of the 28th international conference on machine learning. Omnipress, pp 1081–1088 Li Y, Zhou Z (2011) Towards making unlabeled data never hurt. In: Proceedings of the 28th international conference on machine learning. Omnipress, pp 1081–1088
15.
Zurück zum Zitat Liu X, Pan S, Hao Z, Lin Z (2014) Graph-based semi-supervised learning by mixed label propagation with a soft constraint. Inf Sci 277:327–337MathSciNetCrossRef Liu X, Pan S, Hao Z, Lin Z (2014) Graph-based semi-supervised learning by mixed label propagation with a soft constraint. Inf Sci 277:327–337MathSciNetCrossRef
16.
Zurück zum Zitat Ni T, Chung FL, Wang S (2015) Support vector machine with manifold regularization and partially labeling privacy protection. Inf Sci 294:390–407MathSciNetCrossRef Ni T, Chung FL, Wang S (2015) Support vector machine with manifold regularization and partially labeling privacy protection. Inf Sci 294:390–407MathSciNetCrossRef
17.
Zurück zum Zitat Qi Z, Xu Y, Wang L, Song Y (2011) Online multiple instance boosting for object detection. Neurocomputing 74(10):1769–1775CrossRef Qi Z, Xu Y, Wang L, Song Y (2011) Online multiple instance boosting for object detection. Neurocomputing 74(10):1769–1775CrossRef
18.
Zurück zum Zitat Van Vaerenbergh S, Santamaria I, Barbano P (2011) Semi-supervised handwritten digit recognition using very few labeled data. In: Proceedings of the 2011 IEEE international conference on acoustics, speech and signal processing, pp 2136–2139 Van Vaerenbergh S, Santamaria I, Barbano P (2011) Semi-supervised handwritten digit recognition using very few labeled data. In: Proceedings of the 2011 IEEE international conference on acoustics, speech and signal processing, pp 2136–2139
19.
Zurück zum Zitat Varadarajan B, Yu D, Deng L, Acero A (2009) Using collective information in semi-supervised learning for speech recognition. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, pp 4633–4636. IEEE Varadarajan B, Yu D, Deng L, Acero A (2009) Using collective information in semi-supervised learning for speech recognition. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, pp 4633–4636. IEEE
20.
Zurück zum Zitat Wang XZ, Dong CR (2009) Improving generalization of fuzzy if-then rules by maximizing fuzzy entropy. IEEE Trans Fuzzy Syst 17(3):556–567CrossRef Wang XZ, Dong CR (2009) Improving generalization of fuzzy if-then rules by maximizing fuzzy entropy. IEEE Trans Fuzzy Syst 17(3):556–567CrossRef
21.
Zurück zum Zitat Wang XZ, Dong LC, Yan JH (2012) Maximum ambiguity-based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505CrossRef Wang XZ, Dong LC, Yan JH (2012) Maximum ambiguity-based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505CrossRef
22.
Zurück zum Zitat Wang XZ, Xing HJ, Li Y, Hua Q, Dong CR, Pedrycz W (2014) A study on relationship between generalization abilities and fuzziness of base classifiers in ensemble learning. IEEE Trans Fuzzy Syst. doi:10.1109/TFUZZ.2014.2371479 Wang XZ, Xing HJ, Li Y, Hua Q, Dong CR, Pedrycz W (2014) A study on relationship between generalization abilities and fuzziness of base classifiers in ensemble learning. IEEE Trans Fuzzy Syst. doi:10.​1109/​TFUZZ.​2014.​2371479
23.
Zurück zum Zitat Wang Y, Chen S (2013) Safety-aware semi-supervised classification. IEEE Trans Neural Netw Learn Syst 24(11):1763–1772CrossRef Wang Y, Chen S (2013) Safety-aware semi-supervised classification. IEEE Trans Neural Netw Learn Syst 24(11):1763–1772CrossRef
24.
Zurück zum Zitat Xu J, Zhang X, Li Y (2001) Kernel mse algorithm: a unified framework for KFD, LS-SVM and KRR. In: Proceedings of international joint conference on neural networks, pp 1486–1491 Xu J, Zhang X, Li Y (2001) Kernel mse algorithm: a unified framework for KFD, LS-SVM and KRR. In: Proceedings of international joint conference on neural networks, pp 1486–1491
25.
Zurück zum Zitat Yang T, Priebe CE (2011) The effect of model misspecification on semi-supervised classification. IEEE Trans Pattern Anal Mach Intell 33(10):2093–2103CrossRef Yang T, Priebe CE (2011) The effect of model misspecification on semi-supervised classification. IEEE Trans Pattern Anal Mach Intell 33(10):2093–2103CrossRef
26.
Zurück zum Zitat Zhu X (2005) Semi-supervised learning literature survey. Technical Report 1530, Computer Sciences, University of Wisconsin-Madison Zhu X (2005) Semi-supervised learning literature survey. Technical Report 1530, Computer Sciences, University of Wisconsin-Madison
Metadaten
Titel
A risk degree-based safe semi-supervised learning algorithm
verfasst von
Haitao Gan
ZhiZeng Luo
Ming Meng
Yuliang Ma
Qingshan She
Publikationsdatum
01.02.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 1/2016
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-015-0416-8

Weitere Artikel der Ausgabe 1/2016

International Journal of Machine Learning and Cybernetics 1/2016 Zur Ausgabe

Neuer Inhalt