Skip to main content

2016 | OriginalPaper | Buchkapitel

Co-training with Credal Models

verfasst von : Yann Soullard, Sébastien Destercke, Indira Thouvenin

Erschienen in: Artificial Neural Networks in Pattern Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

So-called credal classifiers offer an interesting approach when the reliability or robustness of predictions have to be guaranteed. Through the use of convex probability sets, they can select multiple classes as prediction when information is insufficient and predict a unique class only when the available information is rich enough. The goal of this paper is to explore whether this particular feature can be used advantageously in the setting of co-training, in which a classifier strengthen another one by feeding it with new labeled data. We propose several co-training strategies to exploit the potential indeterminacy of credal classifiers and test them on several UCI datasets. We then compare the best strategy to the standard co-training process to check its efficiency.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We use capital letter to denote the fact that the returned prediction may be a set of classes.
 
2
We use \(40\,\%\) of them as labeled instances to get a compromise between having a large part of data as unlabeled and having a sufficiently large labeled dataset to reduce the sampling bias.
 
Literatur
1.
Zurück zum Zitat Amini, M., Usunier, N.: Learning with Partially Labeled and Interdependent Data. Springer, Switzerland (2015)CrossRefMATH Amini, M., Usunier, N.: Learning with Partially Labeled and Interdependent Data. Springer, Switzerland (2015)CrossRefMATH
2.
Zurück zum Zitat Antonucci, A., Corani, G., Gabaglio, S.: Active learning by the naive credal classifier. In: Sixth European Workshop on Probabilistic Graphical Models (PGM 2012), pp. 3–10 (2012) Antonucci, A., Corani, G., Gabaglio, S.: Active learning by the naive credal classifier. In: Sixth European Workshop on Probabilistic Graphical Models (PGM 2012), pp. 3–10 (2012)
3.
Zurück zum Zitat Augustin, T., Coolen, F.P., de Cooman, G., Troffaes, M.C.: Introduction to Imprecise Probabilities. Wiley, Chichester (2014)CrossRefMATH Augustin, T., Coolen, F.P., de Cooman, G., Troffaes, M.C.: Introduction to Imprecise Probabilities. Wiley, Chichester (2014)CrossRefMATH
4.
Zurück zum Zitat Balcan, M.F., Blum, A., Yang, K.: Co-training and expansion: towards bridging theory and practice (2004) Balcan, M.F., Blum, A., Yang, K.: Co-training and expansion: towards bridging theory and practice (2004)
5.
Zurück zum Zitat Bernard, J.M.: An introduction to the imprecise Dirichlet model for multinomial data. Int. J. Approx. Reason. 39(2), 123–150 (2005)MathSciNetCrossRefMATH Bernard, J.M.: An introduction to the imprecise Dirichlet model for multinomial data. Int. J. Approx. Reason. 39(2), 123–150 (2005)MathSciNetCrossRefMATH
6.
Zurück zum Zitat Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Eleventh Annual Conference on Computational Learning Theory, COLT 1998, pp. 92–100. ACM (1998) Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Eleventh Annual Conference on Computational Learning Theory, COLT 1998, pp. 92–100. ACM (1998)
7.
Zurück zum Zitat Chapelle, O., Schlkopf, B., Zien, A.: Semi-supervised Learning. MIT Press, Cambridge (2006)CrossRef Chapelle, O., Schlkopf, B., Zien, A.: Semi-supervised Learning. MIT Press, Cambridge (2006)CrossRef
8.
Zurück zum Zitat Corani, G., Zaffalon, M.: Credal model averaging: an extension of Bayesian model averaging to imprecise probabilities. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part I. LNCS (LNAI), vol. 5211, pp. 257–271. Springer, Heidelberg (2008)CrossRef Corani, G., Zaffalon, M.: Credal model averaging: an extension of Bayesian model averaging to imprecise probabilities. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part I. LNCS (LNAI), vol. 5211, pp. 257–271. Springer, Heidelberg (2008)CrossRef
9.
Zurück zum Zitat Corani, G., Zaffalon, M.: Learning reliable classifiers from small or incomplete data sets: the naive credal classifier 2. J. Mach. Learn. Res. 9, 581–621 (2008)MathSciNetMATH Corani, G., Zaffalon, M.: Learning reliable classifiers from small or incomplete data sets: the naive credal classifier 2. J. Mach. Learn. Res. 9, 581–621 (2008)MathSciNetMATH
10.
Zurück zum Zitat Corani, G., Zaffalon, M.: Naive credal classifier 2: an extension of naive bayes for delivering robust classifications. DMIN 8, 84–90 (2008). CSREA Press Corani, G., Zaffalon, M.: Naive credal classifier 2: an extension of naive bayes for delivering robust classifications. DMIN 8, 84–90 (2008). CSREA Press
11.
Zurück zum Zitat Dasgupta, S., Littman, M.L., McAllester, D.A.: PAC generalization bounds for co-training. In: Dietterich, T., Becker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems, vol. 14, pp. 375–382. MIT Press, Cambridge (2002) Dasgupta, S., Littman, M.L., McAllester, D.A.: PAC generalization bounds for co-training. In: Dietterich, T., Becker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems, vol. 14, pp. 375–382. MIT Press, Cambridge (2002)
12.
Zurück zum Zitat Destercke, S., Quost, B.: Combining binary classifiers with imprecise probabilities. In: Tang, Y., Huynh, V.-N., Lawry, J. (eds.) IUKM 2011. LNCS, vol. 7027, pp. 219–230. Springer, Heidelberg (2011)CrossRef Destercke, S., Quost, B.: Combining binary classifiers with imprecise probabilities. In: Tang, Y., Huynh, V.-N., Lawry, J. (eds.) IUKM 2011. LNCS, vol. 7027, pp. 219–230. Springer, Heidelberg (2011)CrossRef
13.
Zurück zum Zitat Grandvalet, Y., Bengio, Y.: Semi-supervised learning by entropy minimization. Network 17(5), 529–536 (2005) Grandvalet, Y., Bengio, Y.: Semi-supervised learning by entropy minimization. Network 17(5), 529–536 (2005)
14.
Zurück zum Zitat Kingma, D.P., Rezende, D.J., Mohamed, S., Welling, M.: Semi-supervised learning with deep generative models. CoRR abs/1406.5298 (2014) Kingma, D.P., Rezende, D.J., Mohamed, S., Welling, M.: Semi-supervised learning with deep generative models. CoRR abs/1406.5298 (2014)
15.
Zurück zum Zitat Liu, A., Reyzin, L., Ziebart, B.D.: Shift-pessimistic active learning using robust bias-aware prediction, pp. 1–7 (2015) Liu, A., Reyzin, L., Ziebart, B.D.: Shift-pessimistic active learning using robust bias-aware prediction, pp. 1–7 (2015)
16.
Zurück zum Zitat Nigam, K., McCallum, A., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39(2/3), 103–134 (2000)CrossRefMATH Nigam, K., McCallum, A., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39(2/3), 103–134 (2000)CrossRefMATH
17.
Zurück zum Zitat Qi, R.H., Yang, D.L., Li, H.F.: A two-stage semi-supervised weighted naive credal classification model. Innov. Comput. Inf. Control J. 5(2), 503–508 (2011) Qi, R.H., Yang, D.L., Li, H.F.: A two-stage semi-supervised weighted naive credal classification model. Innov. Comput. Inf. Control J. 5(2), 503–508 (2011)
18.
Zurück zum Zitat Settles, B.: Active Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan and Claypool Publishers, San Rafael (2012)MATH Settles, B.: Active Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan and Claypool Publishers, San Rafael (2012)MATH
19.
Zurück zum Zitat Soullard, Y., Saveski, M., Artieres, T.: Joint semi-supervised learning of hidden conditional random fields and hidden Markov models. Pattern Recogn. Lett. (PRL) 37, 161–171 (2013)CrossRef Soullard, Y., Saveski, M., Artieres, T.: Joint semi-supervised learning of hidden conditional random fields and hidden Markov models. Pattern Recogn. Lett. (PRL) 37, 161–171 (2013)CrossRef
20.
Zurück zum Zitat Troffaes, M.C.: Decision making under uncertainty using imprecise probabilities. Int. J. Approx. Reason. 45(1), 17–29 (2007)MathSciNetCrossRefMATH Troffaes, M.C.: Decision making under uncertainty using imprecise probabilities. Int. J. Approx. Reason. 45(1), 17–29 (2007)MathSciNetCrossRefMATH
21.
Zurück zum Zitat Utkin, L.V.: The imprecise Dirichlet model as a basis for a new boosting classification algorithm. Neurocomputing 151, 1374–1383 (2015)CrossRef Utkin, L.V.: The imprecise Dirichlet model as a basis for a new boosting classification algorithm. Neurocomputing 151, 1374–1383 (2015)CrossRef
22.
Zurück zum Zitat Wang, W., Zhou, Z.-H.: Analyzing co-training style algorithms. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 454–465. Springer, Heidelberg (2007)CrossRef Wang, W., Zhou, Z.-H.: Analyzing co-training style algorithms. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 454–465. Springer, Heidelberg (2007)CrossRef
23.
Zurück zum Zitat Wang, W., Zhou, Z.H.: A new analysis of co-training. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 1135–1142 (2010) Wang, W., Zhou, Z.H.: A new analysis of co-training. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 1135–1142 (2010)
24.
Zurück zum Zitat Wang, W., Zhou, Z.H.: Co-training with insufficient views. In: Asian Conference on Machine Learning, pp. 467–482 (2013) Wang, W., Zhou, Z.H.: Co-training with insufficient views. In: Asian Conference on Machine Learning, pp. 467–482 (2013)
25.
Zurück zum Zitat Zaffalon, M.: The naive credal classifier. J. Stat. Plan. Inference 105(1), 5–21 (2002). Imprecise Probability Models and their ApplicationsMathSciNetCrossRefMATH Zaffalon, M.: The naive credal classifier. J. Stat. Plan. Inference 105(1), 5–21 (2002). Imprecise Probability Models and their ApplicationsMathSciNetCrossRefMATH
26.
Zurück zum Zitat Zhang, M.L., Zhou, Z.H.: Cotrade: confident co-training with data editing. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 41(6), 1612–1626 (2011)CrossRef Zhang, M.L., Zhou, Z.H.: Cotrade: confident co-training with data editing. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 41(6), 1612–1626 (2011)CrossRef
27.
Zurück zum Zitat Zhou, Z.H.: When semi-supervised learning meets ensemble learning. Front. Electr. Electron. Eng. China 6(1), 6–16 (2011)CrossRef Zhou, Z.H.: When semi-supervised learning meets ensemble learning. Front. Electr. Electron. Eng. China 6(1), 6–16 (2011)CrossRef
28.
Zurück zum Zitat Zhu, X., Goldberg, A.B., Brachman, R., Dietterich, T.: Introduction to Semi-Supervised Learning. Morgan and Claypool Publishers, San Francisco (2009) Zhu, X., Goldberg, A.B., Brachman, R., Dietterich, T.: Introduction to Semi-Supervised Learning. Morgan and Claypool Publishers, San Francisco (2009)
Metadaten
Titel
Co-training with Credal Models
verfasst von
Yann Soullard
Sébastien Destercke
Indira Thouvenin
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46182-3_8