Skip to main content
Top

2019 | OriginalPaper | Chapter

Cost Sensitive Learning in the Presence of Symmetric Label Noise

Authors : Sandhya Tripathi, Nandyala Hemachandra

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In binary classification framework, we are interested in making cost sensitive label predictions in the presence of uniform/symmetric label noise. We first observe that 0–1 Bayes classifiers are not (uniform) noise robust in cost sensitive setting. To circumvent this impossibility result, we present two schemes; unlike the existing methods, our schemes do not require noise rate. The first one uses \(\alpha \)-weighted \(\gamma \)-uneven margin squared loss function, \(l_{\alpha , usq}\), which can handle cost sensitivity arising due to domain requirement (using user given \(\alpha \)) or class imbalance (by tuning \(\gamma \)) or both. However, we observe that \(l_{\alpha , usq}\) Bayes classifiers are also not cost sensitive and noise robust. We show that regularized ERM of this loss function over the class of linear classifiers yields a cost sensitive uniform noise robust classifier as a solution of a system of linear equations. We also provide a performance bound for this classifier. The second scheme that we propose is a re-sampling based scheme that exploits the special structure of the uniform noise models and uses in-class probability estimates. Our computational experiments on some UCI datasets with class imbalance show that classifiers of our two schemes are on par with the existing methods and in fact better in some cases w.r.t. Accuracy and Arithmetic Mean, without using/tuning noise rate. We also consider other cost sensitive performance measures viz., F measure and Weighted Cost for evaluation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Dheeru, D., Karra Taniskidou, E.: UCI Machine Learning Repository (2017) Dheeru, D., Karra Taniskidou, E.: UCI Machine Learning Repository (2017)
2.
go back to reference Elkan, C.: The foundations of cost-sensitive learning. In: International Joint Conference on Artificial Intelligence, vol. 17, pp. 973–978 (2001) Elkan, C.: The foundations of cost-sensitive learning. In: International Joint Conference on Artificial Intelligence, vol. 17, pp. 973–978 (2001)
3.
go back to reference Ghosh, A., Manwani, N., Sastry, P.S.: Making risk minimization tolerant to label noise. Neurocomputing 160, 93–107 (2015)CrossRef Ghosh, A., Manwani, N., Sastry, P.S.: Making risk minimization tolerant to label noise. Neurocomputing 160, 93–107 (2015)CrossRef
4.
go back to reference Manwani, N., Sastry, P.S.: Noise tolerance under risk minimization. IEEE Trans. Cybern. 43(3), 1146–1151 (2013)CrossRef Manwani, N., Sastry, P.S.: Noise tolerance under risk minimization. IEEE Trans. Cybern. 43(3), 1146–1151 (2013)CrossRef
5.
go back to reference Masnadi-Shirazi, H., Vasconcelos, N.: Risk minimization, probability elicitation, and cost-sensitive SVMs. In: International Conference on Machine Learning, pp. 759–766 (2010) Masnadi-Shirazi, H., Vasconcelos, N.: Risk minimization, probability elicitation, and cost-sensitive SVMs. In: International Conference on Machine Learning, pp. 759–766 (2010)
6.
go back to reference Natarajan, N., Dhillon, I.S., Ravikumar, P., Tewari, A.: Cost-sensitive learning with noisy labels. J. Mach. Learn. Res. 18(155), 1–33 (2018)MathSciNetMATH Natarajan, N., Dhillon, I.S., Ravikumar, P., Tewari, A.: Cost-sensitive learning with noisy labels. J. Mach. Learn. Res. 18(155), 1–33 (2018)MathSciNetMATH
7.
go back to reference Natarajan, N., Dhillon, I.S., Ravikumar, P.K., Tewari, A.: Learning with noisy labels. In: Advances in Neural Information Processing Systems, pp. 1196–1204 (2013) Natarajan, N., Dhillon, I.S., Ravikumar, P.K., Tewari, A.: Learning with noisy labels. In: Advances in Neural Information Processing Systems, pp. 1196–1204 (2013)
8.
go back to reference Patrini, G., Nielsen, F., Nock, R., Carioni, M.: Loss factorization, weakly supervised learning and label noise robustness. In: International Conference on Machine Learning, pp. 708–717 (2016) Patrini, G., Nielsen, F., Nock, R., Carioni, M.: Loss factorization, weakly supervised learning and label noise robustness. In: International Conference on Machine Learning, pp. 708–717 (2016)
9.
go back to reference Reid, M.D., Williamson, R.C.: Composite binary losses. J. Mach. Learn. Res. 11(Sep), 2387–2422 (2010)MathSciNetMATH Reid, M.D., Williamson, R.C.: Composite binary losses. J. Mach. Learn. Res. 11(Sep), 2387–2422 (2010)MathSciNetMATH
11.
go back to reference Sugiyama, M.: Superfast-trainable multi-class probabilistic classifier by least-squares posterior fitting. IEICE Trans. Inf. Syst. 93(10), 2690–2701 (2010)CrossRef Sugiyama, M.: Superfast-trainable multi-class probabilistic classifier by least-squares posterior fitting. IEICE Trans. Inf. Syst. 93(10), 2690–2701 (2010)CrossRef
12.
go back to reference Sugiyama, M., Nakajima, S., Kashima, H., Buenau, P.V., Kawanabe, M.: Direct importance estimation with model selection and its application to covariate shift adaptation. In: Advances in Neural Information Processing Systems, pp. 1433–1440 (2008) Sugiyama, M., Nakajima, S., Kashima, H., Buenau, P.V., Kawanabe, M.: Direct importance estimation with model selection and its application to covariate shift adaptation. In: Advances in Neural Information Processing Systems, pp. 1433–1440 (2008)
13.
go back to reference Van Rooyen, B., Menon, A., Williamson, R.C.: Learning with symmetric label noise: the importance of being unhinged. In: Advances in Neural Information Processing Systems, pp. 10–18 (2015) Van Rooyen, B., Menon, A., Williamson, R.C.: Learning with symmetric label noise: the importance of being unhinged. In: Advances in Neural Information Processing Systems, pp. 10–18 (2015)
14.
go back to reference Zhu, X., Wu, X., Khoshgoftaar, T.M., Shi, Y.: An empirical study of the noise impact on cost-sensitive learning. In: International Joint Conference on Artificial Intelligence, pp. 1168–1174 (2007) Zhu, X., Wu, X., Khoshgoftaar, T.M., Shi, Y.: An empirical study of the noise impact on cost-sensitive learning. In: International Joint Conference on Artificial Intelligence, pp. 1168–1174 (2007)
Metadata
Title
Cost Sensitive Learning in the Presence of Symmetric Label Noise
Authors
Sandhya Tripathi
Nandyala Hemachandra
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-16148-4_2

Premium Partner