Skip to main content
Top

2015 | OriginalPaper | Chapter

Hybrid Approach for Inductive Semi Supervised Learning Using Label Propagation and Support Vector Machine

Authors : Aruna Govada, Pravin Joshi, Sahil Mittal, Sanjay K. Sahay

Published in: Machine Learning and Data Mining in Pattern Recognition

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Semi supervised learning methods have gained importance in today’s world because of large expenses and time involved in labeling the unlabeled data by human experts. The proposed hybrid approach uses SVM and Label Propagation to label the unlabeled data. In the process, at each step SVM is trained to minimize the error and thus improve the prediction quality. Experiments are conducted by using SVM and logistic regression(Logreg). Results prove that SVM performs tremendously better than Logreg. The approach is tested using 12 datasets of different sizes ranging from the order of 1000s to the order of 10000s. Results show that the proposed approach outperforms Label Propagation by a large margin with F-measure of almost twice on average. The parallel version of the proposed approach is also designed and implemented, the analysis shows that the training time decreases significantly when parallel version is used.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Castelli, V., Cover, T.: The exponential value of labeled samples. Pattern Recogn. Lett. 16, 105–111 (1995)CrossRef Castelli, V., Cover, T.: The exponential value of labeled samples. Pattern Recogn. Lett. 16, 105–111 (1995)CrossRef
2.
go back to reference Castelli, V., Cover, T.: The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter. IEEE Trans. Inf. Theory 42, 2101–2117 (1996)MathSciNetCrossRef Castelli, V., Cover, T.: The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter. IEEE Trans. Inf. Theory 42, 2101–2117 (1996)MathSciNetCrossRef
3.
go back to reference Ratsaby, J., Venkatesh, S.: Learning from a mixture of labeled and unlabeled examples with parametric side information. In: Proceedings of the Eighth Annual Conference on Computational Learning Theory, pp. 412–417 (1995) Ratsaby, J., Venkatesh, S.: Learning from a mixture of labeled and unlabeled examples with parametric side information. In: Proceedings of the Eighth Annual Conference on Computational Learning Theory, pp. 412–417 (1995)
4.
go back to reference Cozman, F.G., Cohen, I., Cirelo, M.C.: Semi-supervised learning of mixture models. In: ICML (2003) Cozman, F.G., Cohen, I., Cirelo, M.C.: Semi-supervised learning of mixture models. In: ICML (2003)
5.
go back to reference Corduneanu, A., Jaakkola, T.: Stable mixing of complete and incomplete information, Technical report AIM-2001-030, MIT AI Memo (2001) Corduneanu, A., Jaakkola, T.: Stable mixing of complete and incomplete information, Technical report AIM-2001-030, MIT AI Memo (2001)
6.
go back to reference Callison-Burch, C., Talbot, D., Osborne, M.: Statistical machine translation with word- and sentence-aligned parallel corpora. In: Proceedings of the ACL (2004) Callison-Burch, C., Talbot, D., Osborne, M.: Statistical machine translation with word- and sentence-aligned parallel corpora. In: Proceedings of the ACL (2004)
7.
go back to reference Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2000)MATHCrossRef Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2000)MATHCrossRef
8.
go back to reference Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. B, 1–38 (1977) Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. B, 1–38 (1977)
9.
go back to reference Bennett, K., Demiriz, A.: Semi-supervised support vector machines. Adv. Neural Inf. Proc. Syst. 11, 368–374 (1999) Bennett, K., Demiriz, A.: Semi-supervised support vector machines. Adv. Neural Inf. Proc. Syst. 11, 368–374 (1999)
10.
go back to reference Dara, R., Kremer, S., Stacey, D.: Clustering unlabeled data with SOMs improves classification of labeled real-world data. In: Proceedings of the World Congress on Computational Intelligence (WCCI) (2002) Dara, R., Kremer, S., Stacey, D.: Clustering unlabeled data with SOMs improves classification of labeled real-world data. In: Proceedings of the World Congress on Computational Intelligence (WCCI) (2002)
11.
go back to reference Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pp. 189–196 (1995) Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pp. 189–196 (1995)
12.
go back to reference Riloff, E., Wiebe, J., Wilson, T.: Learning subjective nouns using extraction pattern bootstrapping. In: Proceedings of the Seventh Conference on Natural Language Learning (CoNLL-2003) (2003) Riloff, E., Wiebe, J., Wilson, T.: Learning subjective nouns using extraction pattern bootstrapping. In: Proceedings of the Seventh Conference on Natural Language Learning (CoNLL-2003) (2003)
13.
go back to reference Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self training of object detection models. In: Seventh IEEE Workshop on Applications of Computer Vision (2005) Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self training of object detection models. In: Seventh IEEE Workshop on Applications of Computer Vision (2005)
14.
go back to reference Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory, COLT (1998) Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory, COLT (1998)
15.
go back to reference Balcan, M.-F., Blum, A.: An augmented PAC model for semi-supervised learning. In: Chapelle, O., Sch Olkopf, B., Zien, A. (eds.) Semi-supervised learning. MIT Press, Cambridge (2006) Balcan, M.-F., Blum, A.: An augmented PAC model for semi-supervised learning. In: Chapelle, O., Sch Olkopf, B., Zien, A. (eds.) Semi-supervised learning. MIT Press, Cambridge (2006)
16.
go back to reference Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. Technical report CMU-CALD-02-107, Carnegie Mellon University (2002) Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. Technical report CMU-CALD-02-107, Carnegie Mellon University (2002)
17.
go back to reference Wang, F., Zhang, C.: Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Eng. 20(1), 55–67 (2008)CrossRef Wang, F., Zhang, C.: Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Eng. 20(1), 55–67 (2008)CrossRef
18.
go back to reference Zhu, X., Lafferty, J., Rosenfeld, R.: Semi-supervised learning with graphs. Diss. Carnegie Mellon University, Language Technologies Institute, School of Computer Science (2005) Zhu, X., Lafferty, J., Rosenfeld, R.: Semi-supervised learning with graphs. Diss. Carnegie Mellon University, Language Technologies Institute, School of Computer Science (2005)
19.
go back to reference Hearst, M.A., et al.: Support vector machines. IEEE Intel. Syst. Appl. 13(4), 18–28 (1998)CrossRef Hearst, M.A., et al.: Support vector machines. IEEE Intel. Syst. Appl. 13(4), 18–28 (1998)CrossRef
Metadata
Title
Hybrid Approach for Inductive Semi Supervised Learning Using Label Propagation and Support Vector Machine
Authors
Aruna Govada
Pravin Joshi
Sahil Mittal
Sanjay K. Sahay
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-21024-7_14

Premium Partner