Skip to main content
Top
Published in: Soft Computing 7/2014

01-07-2014 | Methodologies and Application

CEVCLUS: evidential clustering with instance-level constraints for relational data

Authors: V. Antoine, B. Quost, M.-H. Masson, T. Denoeux

Published in: Soft Computing | Issue 7/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recent advances in clustering consider incorporating background knowledge in the partitioning algorithm, using, e.g., pairwise constraints between objects. As a matter of fact, prior information, when available, often makes it possible to better retrieve meaningful clusters in data. Here, this approach is investigated in the framework of belief functions, which allows us to handle the imprecision and the uncertainty of the clustering process. In this context, the EVCLUS algorithm was proposed for partitioning objects described by a dissimilarity matrix. It is extended here so as to take pairwise constraints into account, by adding a term to its objective function. This term corresponds to a penalty term that expresses pairwise constraints in the belief function framework. Various synthetic and real datasets are considered to demonstrate the interest of the proposed method, called CEVCLUS, and two applications are presented. The performances of CEVCLUS are also compared to those of other constrained clustering algorithms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Antoine V, Quost B, Masson M-H, Denœux T (2012) CECM: constrained evidential C-means algorithm. Computat Stat Data Anal 56:894–914CrossRefMATH Antoine V, Quost B, Masson M-H, Denœux T (2012) CECM: constrained evidential C-means algorithm. Computat Stat Data Anal 56:894–914CrossRefMATH
go back to reference Basu S, Bilenko M, Banerjee A, Mooney R (2006) Probabilistic semi-supervised clustering with constraints. In: Chapelle O, Schölkopf B, Zien A (eds) Semi-supervised learning. MIT Press, Cambridge, pp 71–98 Basu S, Bilenko M, Banerjee A, Mooney R (2006) Probabilistic semi-supervised clustering with constraints. In: Chapelle O, Schölkopf B, Zien A (eds) Semi-supervised learning. MIT Press, Cambridge, pp 71–98
go back to reference Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Kluwer, NorwellCrossRefMATH Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Kluwer, NorwellCrossRefMATH
go back to reference Bunke H, Bühler U (1993) Applications of approximate string matching to 2d shape recognition. Pattern Recogn 26(12):1797–1812CrossRef Bunke H, Bühler U (1993) Applications of approximate string matching to 2d shape recognition. Pattern Recogn 26(12):1797–1812CrossRef
go back to reference Davidson I, Ravi S (2005) Agglomerative hierarchical clustering with constraints: theoretical and empirical results. Proceedings of the 9th European conference on principles and practice of knowledge discovery in databases (PKDD), vol 3721. Springer, Porto, pp 59–70 Davidson I, Ravi S (2005) Agglomerative hierarchical clustering with constraints: theoretical and empirical results. Proceedings of the 9th European conference on principles and practice of knowledge discovery in databases (PKDD), vol 3721. Springer, Porto, pp 59–70
go back to reference Davidson I, Wagstaff K, Basu S (2006) Measuring constraint-set utility for partitional clustering algorithms. In: Proceedings of the 10th European conference on principles and practice of knowledge discovery in databases (PKDD). Berlin, Germany, vol 4213, pp 115–126 Davidson I, Wagstaff K, Basu S (2006) Measuring constraint-set utility for partitional clustering algorithms. In: Proceedings of the 10th European conference on principles and practice of knowledge discovery in databases (PKDD). Berlin, Germany, vol 4213, pp 115–126
go back to reference Denœux T, Masson M-H (2004) EVCLUS: evidential clustering of proximity data. IEEE Trans Syst Man Cybern B 34(1):95–109 Denœux T, Masson M-H (2004) EVCLUS: evidential clustering of proximity data. IEEE Trans Syst Man Cybern B 34(1):95–109
go back to reference Everitt BS, Landau S, Leese M (2009) Hierarchical clustering. Cluster analysis, 4th edn. Wiley, New York, pp 55–89 Everitt BS, Landau S, Leese M (2009) Hierarchical clustering. Cluster analysis, 4th edn. Wiley, New York, pp 55–89
go back to reference Frigui H, Hwang C (2007) Adaptive concept learning through clustering and aggregation of relational data. Proceedings of the 7th SIAM international conference on data mining, Minneaplis, USA, pp 90–101 Frigui H, Hwang C (2007) Adaptive concept learning through clustering and aggregation of relational data. Proceedings of the 7th SIAM international conference on data mining, Minneaplis, USA, pp 90–101
go back to reference Gondek D, Hofmann T (2007) Non-redundant data clustering. Knowl Inf Syst 12(1):1–24 Gondek D, Hofmann T (2007) Non-redundant data clustering. Knowl Inf Syst 12(1):1–24
go back to reference Grira N, Crucianu M, Boujemaa N (2008) Active semi-supervised fuzzy clustering. Pattern Recogn 41(5):1834–1844CrossRefMATH Grira N, Crucianu M, Boujemaa N (2008) Active semi-supervised fuzzy clustering. Pattern Recogn 41(5):1834–1844CrossRefMATH
go back to reference Hamasuna Y, Endo Y (2012) On semi-supervised fuzzy c-means clustering for data with clusterwise tolerance by opposite criteria. Soft Comput Fusion Found Methodol Appl 1–11 Hamasuna Y, Endo Y (2012) On semi-supervised fuzzy c-means clustering for data with clusterwise tolerance by opposite criteria. Soft Comput Fusion Found Methodol Appl 1–11
go back to reference Kannan S, Sathya A, Ramathilagam S (2011) Effective fuzzy clustering techniques for segmentation of breast mri. Soft Comput Fusion Found Methodol Appl 15:483–491 Kannan S, Sathya A, Ramathilagam S (2011) Effective fuzzy clustering techniques for segmentation of breast mri. Soft Comput Fusion Found Methodol Appl 15:483–491
go back to reference Klein D, Kamvar S, Manning C (2002) From instance-level constraints to space-level constraints: making the most of prior knowledge in data clustering. In: Proceedings of the 19th international conference on machine learning. Australia, Sydney, pp 307–314 Klein D, Kamvar S, Manning C (2002) From instance-level constraints to space-level constraints: making the most of prior knowledge in data clustering. In: Proceedings of the 19th international conference on machine learning. Australia, Sydney, pp 307–314
go back to reference Klir G, Wierman M (1999) Uncertainty-based information: elements of generalized information theory. Springer, New YorkCrossRefMATH Klir G, Wierman M (1999) Uncertainty-based information: elements of generalized information theory. Springer, New YorkCrossRefMATH
go back to reference Kulis B, Basu S, Dhillon I, Mooney R (2005) Semi-supervised graph clustering: a kernel approach. In: 22nd international conference on machine learning (ICML). Bonn, Germany, pp 457–464 Kulis B, Basu S, Dhillon I, Mooney R (2005) Semi-supervised graph clustering: a kernel approach. In: 22nd international conference on machine learning (ICML). Bonn, Germany, pp 457–464
go back to reference Law M, Topchy A, Jain A (2004) Clustering with soft and group constraints. Structural, syntactic, and statistical pattern recognition, Joint IAPR international workshops, SSPR 2004 and SPR 2004, vol 3138. Springer, Lisbon, pp 662–670 Law M, Topchy A, Jain A (2004) Clustering with soft and group constraints. Structural, syntactic, and statistical pattern recognition, Joint IAPR international workshops, SSPR 2004 and SPR 2004, vol 3138. Springer, Lisbon, pp 662–670
go back to reference Lazzerini B, Marcelloni F (2007) A hierarchical fuzzy clustering-based system to create user profiles. Soft Comput Fusion Found Methodol Appl 11:157–168 Lazzerini B, Marcelloni F (2007) A hierarchical fuzzy clustering-based system to create user profiles. Soft Comput Fusion Found Methodol Appl 11:157–168
go back to reference Li YL, Shen Y (2010) An automatic fuzzy c-means algorithm for image segmentation. Soft Comput Fusion Found Methodol Appl 14:123–128 Li YL, Shen Y (2010) An automatic fuzzy c-means algorithm for image segmentation. Soft Comput Fusion Found Methodol Appl 14:123–128
go back to reference Liu Y, Jin R, Jain A (2007) Boostcluster: boosting clustering by pairwise constraints. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, San Jose, pp 450–459 Liu Y, Jin R, Jain A (2007) Boostcluster: boosting clustering by pairwise constraints. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, San Jose, pp 450–459
go back to reference Masson M-H, Denœux T (2004) Clustering interval-valued data using belief functions. Pattern Recogn Lett 25(2):163–171CrossRef Masson M-H, Denœux T (2004) Clustering interval-valued data using belief functions. Pattern Recogn Lett 25(2):163–171CrossRef
go back to reference Masson M-H, Denœux T (2008) ECM: an evidential version of the fuzzy c-means algorithm. Pattern Recogn 41(4):1384–1397CrossRefMATH Masson M-H, Denœux T (2008) ECM: an evidential version of the fuzzy c-means algorithm. Pattern Recogn 41(4):1384–1397CrossRefMATH
go back to reference Masson M-H, Denœux T (2009) RECM: relational evidential c-means algorithm. Pattern Recogn Lett 30(11):1015–1026CrossRef Masson M-H, Denœux T (2009) RECM: relational evidential c-means algorithm. Pattern Recogn Lett 30(11):1015–1026CrossRef
go back to reference Pal N, Bezdek J, Hemasinha R (1992) Uncertainty measures for evidential reasoning i: a review. Int J Approx Reason 7(3–4):165–183 Pal N, Bezdek J, Hemasinha R (1992) Uncertainty measures for evidential reasoning i: a review. Int J Approx Reason 7(3–4):165–183
go back to reference Pedrycz W (2007) Collaborative and knowledge-based fuzzy clustering. Int J Innov Comput Inf Control 1(3):1–12MathSciNet Pedrycz W (2007) Collaborative and knowledge-based fuzzy clustering. Int J Innov Comput Inf Control 1(3):1–12MathSciNet
go back to reference Pekalska E, Duin R (2005) The dissimilarity representation for pattern recognition, vol 64, foundations and applications. World Scientific, Singapore Pekalska E, Duin R (2005) The dissimilarity representation for pattern recognition, vol 64, foundations and applications. World Scientific, Singapore
go back to reference Sammon JW (1969) A nonlinear mapping for data structure analysis. IEEE Trans Comput 18:401–409CrossRef Sammon JW (1969) A nonlinear mapping for data structure analysis. IEEE Trans Comput 18:401–409CrossRef
go back to reference Shafer G (1976) A mathematical theory of evidence. Princeton University Press, PrincetonMATH Shafer G (1976) A mathematical theory of evidence. Princeton University Press, PrincetonMATH
go back to reference Smets P (1990) The combination of evidence in the transferable belief model. IEEE Trans Pattern Anal Mach Intell 12(5):447–458CrossRef Smets P (1990) The combination of evidence in the transferable belief model. IEEE Trans Pattern Anal Mach Intell 12(5):447–458CrossRef
go back to reference Wagstaff K (2007) Value, cost, and sharing: open issues in constrained clustering. Knowl Discov Induct Databases 4747:1–10CrossRef Wagstaff K (2007) Value, cost, and sharing: open issues in constrained clustering. Knowl Discov Induct Databases 4747:1–10CrossRef
go back to reference Wagstaff K, Cardie C, Rogers S, Schroedl S (2001) Constrained k-means clustering with background knowledge. In: Proceedings of the 18th international conference on machine learning. Williamstown, MA, USA, pp 577–584 Wagstaff K, Cardie C, Rogers S, Schroedl S (2001) Constrained k-means clustering with background knowledge. In: Proceedings of the 18th international conference on machine learning. Williamstown, MA, USA, pp 577–584
go back to reference Xing E, Ng A, Jordan M, Russell S (2002) Distance metric learning with application to clustering with side-information. In: Proceedings of the 15th annual conference on advances in neural information processing systems (NIPS). Vancouver, British Columbia, Canada, pp 521–528 Xing E, Ng A, Jordan M, Russell S (2002) Distance metric learning with application to clustering with side-information. In: Proceedings of the 15th annual conference on advances in neural information processing systems (NIPS). Vancouver, British Columbia, Canada, pp 521–528
go back to reference Zhenguo L, Jianzhuang L, Xiaoou T (2009) Constrained clustering via spectral regularization. In: IEEE conference on computer vision and pattern recognition (CVPR). Miami, FL, USA, pp 421–428 Zhenguo L, Jianzhuang L, Xiaoou T (2009) Constrained clustering via spectral regularization. In: IEEE conference on computer vision and pattern recognition (CVPR). Miami, FL, USA, pp 421–428
go back to reference Zhong S, Ghosh J (2003) A unified framework for model-based clustering. J Mach Learn Res 4:1001–1037MathSciNet Zhong S, Ghosh J (2003) A unified framework for model-based clustering. J Mach Learn Res 4:1001–1037MathSciNet
Metadata
Title
CEVCLUS: evidential clustering with instance-level constraints for relational data
Authors
V. Antoine
B. Quost
M.-H. Masson
T. Denoeux
Publication date
01-07-2014
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 7/2014
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-013-1146-z

Other articles of this Issue 7/2014

Soft Computing 7/2014 Go to the issue

Premium Partner