Skip to main content

2018 | OriginalPaper | Buchkapitel

Semi-supervised Fuzzy c-Means Variants: A Study on Noisy Label Supervision

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Semi-supervised clustering algorithms aim at discovering the hidden structure of data sets with the help of expert knowledge, generally expressed as constraints on the data such as class labels or pairwise relations. Most of the time, the expert is considered as an oracle that only provides correct constraints. This paper focuses on the case where some label constraints are erroneous and proposes to investigate into more detail three semi-supervised fuzzy c-means clustering approaches as they have been tailored to naturally handle uncertainty in the expert labeling. In order to run a fair comparison between existing algorithms, formal improvements have been proposed to guarantee and fasten their convergence. Experiments conducted on real and synthetical datasets under uncertain labels and noise in the constraints show the effectiveness of using fuzzy clustering algorithm for noisy semi-supervised clustering.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Basu, S., Davidson, I., Wagstaff, K.: Constrained Clustering: Advances in Algorithms, Theory, and Applications. Chapman & Hall/CRC, Boca Raton (2008)MATH Basu, S., Davidson, I., Wagstaff, K.: Constrained Clustering: Advances in Algorithms, Theory, and Applications. Chapman & Hall/CRC, Boca Raton (2008)MATH
2.
Zurück zum Zitat Bouchachia, A., Pedrycz, W.: Enhancement of fuzzy clustering by mechanisms of partial supervision. Fuzzy Sets Syst. 157(13), 1733–1759 (2006)MathSciNetCrossRef Bouchachia, A., Pedrycz, W.: Enhancement of fuzzy clustering by mechanisms of partial supervision. Fuzzy Sets Syst. 157(13), 1733–1759 (2006)MathSciNetCrossRef
3.
Zurück zum Zitat Antoine, V., Quost, B., Masson, M.H., Denœux, T.: Evidential clustering with instance-level constraints for proximity data. Soft. Comput. 18(7), 1321–1335 (2014)CrossRef Antoine, V., Quost, B., Masson, M.H., Denœux, T.: Evidential clustering with instance-level constraints for proximity data. Soft. Comput. 18(7), 1321–1335 (2014)CrossRef
4.
Zurück zum Zitat Basu, S., Banerjee, A., Mooney, R.: Active semi-supervision for pairwise constrained clustering. In: Proceedings of 2004 SIAM Interernational Conference on Data Mining, pp. 333–344 (2004)CrossRef Basu, S., Banerjee, A., Mooney, R.: Active semi-supervision for pairwise constrained clustering. In: Proceedings of 2004 SIAM Interernational Conference on Data Mining, pp. 333–344 (2004)CrossRef
5.
Zurück zum Zitat Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: Proceedings of 21st ICML (2004) Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: Proceedings of 21st ICML (2004)
6.
Zurück zum Zitat Wagstaff, K.L.: When is constrained clustering beneficial, and why. In: AAAI (2006) Wagstaff, K.L.: When is constrained clustering beneficial, and why. In: AAAI (2006)
7.
Zurück zum Zitat Vu, V., Labroche, N., Bouchon-Meunier, B.: Boosting clustering by active constraint selection. In: Proceedings of 2010 19th ECAI, pp. 297–302 (2010) Vu, V., Labroche, N., Bouchon-Meunier, B.: Boosting clustering by active constraint selection. In: Proceedings of 2010 19th ECAI, pp. 297–302 (2010)
8.
Zurück zum Zitat Vu, V., Labroche, N., Bouchon-Meunier, B.: An efficient active constraint selection algorithm for clustering. In: 20th ICPR, pp. 2969–2972 (2010) Vu, V., Labroche, N., Bouchon-Meunier, B.: An efficient active constraint selection algorithm for clustering. In: 20th ICPR, pp. 2969–2972 (2010)
10.
Zurück zum Zitat Pedrycz, W., Waletzky, J.: Fuzzy clustering with partial supervision. IEEE Trans. Syst. Man Cybern. Part B Cybern. 27(5), 787–795 (1997)CrossRef Pedrycz, W., Waletzky, J.: Fuzzy clustering with partial supervision. IEEE Trans. Syst. Man Cybern. Part B Cybern. 27(5), 787–795 (1997)CrossRef
11.
Zurück zum Zitat Lai, D., Garibaldi, J.: A comparison of distance-based semi-supervised fuzzy c-means clustering algorithms. In: IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1580–1586 (2011) Lai, D., Garibaldi, J.: A comparison of distance-based semi-supervised fuzzy c-means clustering algorithms. In: IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1580–1586 (2011)
13.
Zurück zum Zitat Gustafson, D., Kessel, W.: Fuzzy clustering with a fuzzy covariance matrix. In: IEEE Conference on Decision and Control Including the 17th Symposium on Adaptive Processes, pp. 761–766 (1979) Gustafson, D., Kessel, W.: Fuzzy clustering with a fuzzy covariance matrix. In: IEEE Conference on Decision and Control Including the 17th Symposium on Adaptive Processes, pp. 761–766 (1979)
14.
Zurück zum Zitat Endo, Y., Hamasuna, Y., Yamashiro, M., Miyamoto, S.: On semi-supervised fuzzy c-means clustering. In: IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1119–1124 (2009) Endo, Y., Hamasuna, Y., Yamashiro, M., Miyamoto, S.: On semi-supervised fuzzy c-means clustering. In: IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1119–1124 (2009)
15.
Zurück zum Zitat Basu, S., Banerjee, A., Mooney, R.: Semi-supervised clustering by seeding. In: Proceedings of 19th International Conference on Machine Learning (ICML), pp. 27–34 (2002) Basu, S., Banerjee, A., Mooney, R.: Semi-supervised clustering by seeding. In: Proceedings of 19th International Conference on Machine Learning (ICML), pp. 27–34 (2002)
16.
Zurück zum Zitat Basu, S., Bilenko, M., Banerjee, A., Mooney, R.: Probabilistic Semi-supervised Clustering with Constraints, pp. 71–98. MIT Press, Cambridge (2006) Basu, S., Bilenko, M., Banerjee, A., Mooney, R.: Probabilistic Semi-supervised Clustering with Constraints, pp. 71–98. MIT Press, Cambridge (2006)
17.
Zurück zum Zitat Rand, W.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66(336), 846–850 (1971)CrossRef Rand, W.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66(336), 846–850 (1971)CrossRef
18.
Zurück zum Zitat Dave, R.: Validating fuzzy partitions obtained through c-shells clustering. Pattern Recogn. Lett. 17(6), 613–623 (1996)CrossRef Dave, R.: Validating fuzzy partitions obtained through c-shells clustering. Pattern Recogn. Lett. 17(6), 613–623 (1996)CrossRef
Metadaten
Titel
Semi-supervised Fuzzy c-Means Variants: A Study on Noisy Label Supervision
verfasst von
Violaine Antoine
Nicolas Labroche
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-91476-3_5