Skip to main content
Top
Published in: Soft Computing 16/2019

02-01-2019 | Foundations

A study of similarity measures through the paradigm of measurement theory: the classic case

Published in: Soft Computing | Issue 16/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Similarity measures are used in various tasks dealing with the management of data or information, such as decision-making, case-based reasoning, cased-based information retrieval, recommendation systems and user profile analysis, to cite but a few. The paper aims at providing information on similarity measures that can help in choosing “a priori” one of them on the basis of the semantics behind this choice. To this end, we study similarity measures from the point of view of the ranking relation they induce on object pairs. Using a classic method of measurement theory, we establish necessary and sufficient conditions for the existence of a particular class of numerical similarity measures, representing a given binary relation among pairs of objects which express the idea of “no more similar than”. The above conditions are all (and only) the rules which are accepted when one decides to evaluate similarity through any element of a specific class of similarity measures. We exemplify the possible application of such conditions and the relevant results on a real-world problem and discuss them in the ambit of cognitive psychology. We consider here a classical context, while the fuzzy context will be studied in a companion paper.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Anderberg MR (1973) Cluster analysis for applications. Academic Press, New YorkMATH Anderberg MR (1973) Cluster analysis for applications. Academic Press, New YorkMATH
go back to reference Baioletti M, Coletti G, Petturiti D (2012) Advances in computational intelligence: 14th international conference on information processing and management of uncertainty in knowledge-based systems, IPMU 2012, Catania, Italy, July 9–13, 2012, Proceedings, Part III, Chapter. Weighted attribute combinations based similarity measures. Springer, Berlin, pp 211–220 Baioletti M, Coletti G, Petturiti D (2012) Advances in computational intelligence: 14th international conference on information processing and management of uncertainty in knowledge-based systems, IPMU 2012, Catania, Italy, July 9–13, 2012, Proceedings, Part III, Chapter. Weighted attribute combinations based similarity measures. Springer, Berlin, pp 211–220
go back to reference Bertoluzza C, Di Bacco M, Doldi V (2004) An axiomatic characterization of the measures of similarity. Sankhya 66:474–486MathSciNetMATH Bertoluzza C, Di Bacco M, Doldi V (2004) An axiomatic characterization of the measures of similarity. Sankhya 66:474–486MathSciNetMATH
go back to reference Boriah S, Chandola V, Kumar V (2008) Similarity measures for categorical data: a comparative evaluation. In: Proceedings of the 8th SIAM international conference on data mining, SIAM, pp 243–254 Boriah S, Chandola V, Kumar V (2008) Similarity measures for categorical data: a comparative evaluation. In: Proceedings of the 8th SIAM international conference on data mining, SIAM, pp 243–254
go back to reference Bouchon-Meunier B, Rifqi M, Lesot MJ (2008) Similarities in fuzzy data mining: from a cognitive view to real-world applications. In Zurada J, Yen G, Wang J (eds) Computational intelligence: research frontiers. WCCI 2008, vol 5050. Springer, LNCS, pp 349–367 Bouchon-Meunier B, Rifqi M, Lesot MJ (2008) Similarities in fuzzy data mining: from a cognitive view to real-world applications. In Zurada J, Yen G, Wang J (eds) Computational intelligence: research frontiers. WCCI 2008, vol 5050. Springer, LNCS, pp 349–367
go back to reference Bouchon-Meunier B, Coletti G, Lesot MJ, Rifqi M (2009) Towards a conscious choice of a similarity measure: a qualitative point of view. In: Sossai C, Ghemello G (eds) Symbolic and quantitative approaches to reasoning with uncertainty: Ecsqaru 2009 proceedings, vol 5590. Springer, LNAI, pp 542–553 Bouchon-Meunier B, Coletti G, Lesot MJ, Rifqi M (2009) Towards a conscious choice of a similarity measure: a qualitative point of view. In: Sossai C, Ghemello G (eds) Symbolic and quantitative approaches to reasoning with uncertainty: Ecsqaru 2009 proceedings, vol 5590. Springer, LNAI, pp 542–553
go back to reference Bouchon-Meunier B, Coletti G, Lesot MJ, Rifqi M (2010) Towards a conscious choice of a fuzzy similarity measure: a qualitative point of view. In: Hllermeier E, Kruse R, Hoffmann F (eds) Computational intelligence for knowledge-based system design: IPMU 2010 proceedings, vol 6178. Springer, LNAI, pp 1–10 Bouchon-Meunier B, Coletti G, Lesot MJ, Rifqi M (2010) Towards a conscious choice of a fuzzy similarity measure: a qualitative point of view. In: Hllermeier E, Kruse R, Hoffmann F (eds) Computational intelligence for knowledge-based system design: IPMU 2010 proceedings, vol 6178. Springer, LNAI, pp 1–10
go back to reference Choi S-S, Cha S-H, Tappert CC (2010) A survey of binary similarity and distance measures. J Syst Cybern Inf 8(1):43–48 Choi S-S, Cha S-H, Tappert CC (2010) A survey of binary similarity and distance measures. J Syst Cybern Inf 8(1):43–48
go back to reference Coletti G, Bouchon-Meunier B (2018) A study of similarity measures through the paradigm of measurement theory: the fuzzy case. SoftComputing (submitted) Coletti G, Bouchon-Meunier B (2018) A study of similarity measures through the paradigm of measurement theory: the fuzzy case. SoftComputing (submitted)
go back to reference Coletti G, Di Bacco M (1989) Qualitative characterization of a dissimilarity and concentration index. Metron XLVII:121–130MathSciNetMATH Coletti G, Di Bacco M (1989) Qualitative characterization of a dissimilarity and concentration index. Metron XLVII:121–130MathSciNetMATH
go back to reference Coletti G, Petturiti D, Vantaggi B (2017) Fuzzy weighted attribute combinations based similarity measures. In: Proceedings of ECSQARU 2017 (Symbolic and quantitative approaches to reasoning with uncertainty), vol 10369. LNCS, pp 364–374 Coletti G, Petturiti D, Vantaggi B (2017) Fuzzy weighted attribute combinations based similarity measures. In: Proceedings of ECSQARU 2017 (Symbolic and quantitative approaches to reasoning with uncertainty), vol 10369. LNCS, pp 364–374
go back to reference Couso I, Garrido L, Sànchez L (2013) Similarity and dissimilarity measures between fuzzy sets: a formal relational study. Inf Sci 229:122–141MathSciNetCrossRefMATH Couso I, Garrido L, Sànchez L (2013) Similarity and dissimilarity measures between fuzzy sets: a formal relational study. Inf Sci 229:122–141MathSciNetCrossRefMATH
go back to reference Cross VV, Sudkamp TA (2002) Similarity and compatibility in fuzzy set theory: assessment and applications. Studies in fuzziness and soft computing, vol 93. Springer, BerlinCrossRefMATH Cross VV, Sudkamp TA (2002) Similarity and compatibility in fuzzy set theory: assessment and applications. Studies in fuzziness and soft computing, vol 93. Springer, BerlinCrossRefMATH
go back to reference Dice LR (1945) Measures of the amount of ecological association between species. Ecology 26:297–302CrossRef Dice LR (1945) Measures of the amount of ecological association between species. Ecology 26:297–302CrossRef
go back to reference Dvoraki J, Baume N, Botré Broséus J, Budgett R, Frey WO, Geyer H, Harcourt PR, Ho D, Howman D, Isola V, Lundby C, Marclay F, Peytavin A, Pipe A, Pitsiladis YP, Reichel C, Robinson N, Rodchenkov G, Saugy M, Sayegh S, Segura J, Thevis M, Vernec A, Viret M, Vouillamoz M, Zorzoli M (2014) Time for change: a roadmap to guide the implementation of the World Anti-Doping Code 2015. Br J Sports Med: BJSM 48:801–806CrossRef Dvoraki J, Baume N, Botré Broséus J, Budgett R, Frey WO, Geyer H, Harcourt PR, Ho D, Howman D, Isola V, Lundby C, Marclay F, Peytavin A, Pipe A, Pitsiladis YP, Reichel C, Robinson N, Rodchenkov G, Saugy M, Sayegh S, Segura J, Thevis M, Vernec A, Viret M, Vouillamoz M, Zorzoli M (2014) Time for change: a roadmap to guide the implementation of the World Anti-Doping Code 2015. Br J Sports Med: BJSM 48:801–806CrossRef
go back to reference Filev P, Hadjiiski L, Sahiner B, Chan HP, Helvie MA (2005) Comparison of similarity measures for the task of template matching of masses on serial mammograms. Med Phys 32(2):515–529CrossRef Filev P, Hadjiiski L, Sahiner B, Chan HP, Helvie MA (2005) Comparison of similarity measures for the task of template matching of masses on serial mammograms. Med Phys 32(2):515–529CrossRef
go back to reference Gilboa I, Lieberman O, Schmeidler D (2006) A similarity-based approach to prediction. Rev Econ Stat 162(1):124–131MathSciNetMATH Gilboa I, Lieberman O, Schmeidler D (2006) A similarity-based approach to prediction. Rev Econ Stat 162(1):124–131MathSciNetMATH
go back to reference Ha V, Haddawy P (2003) Similarity of personal preferences: theoretical foundations and empirical analysis. Artif Intell 146:149–173MathSciNetCrossRefMATH Ha V, Haddawy P (2003) Similarity of personal preferences: theoretical foundations and empirical analysis. Artif Intell 146:149–173MathSciNetCrossRefMATH
go back to reference Hahn U, Ramscar M (eds) (2001) Similarity and categorization. Oxford University Press, Oxford Hahn U, Ramscar M (eds) (2001) Similarity and categorization. Oxford University Press, Oxford
go back to reference Hwang CM, Yang MS, Hung WL, Lee MG (2012) A similarity measure of intuitionistic fuzzy sets based on the Sugeno integral with its application to pattern recognition. Inf Sci 189:93–109MathSciNetCrossRefMATH Hwang CM, Yang MS, Hung WL, Lee MG (2012) A similarity measure of intuitionistic fuzzy sets based on the Sugeno integral with its application to pattern recognition. Inf Sci 189:93–109MathSciNetCrossRefMATH
go back to reference Jaccard P (1908) Nouvelles recherches sur la distribution florale. Bull Soc Vaud Sci Nat 44:223–270 Jaccard P (1908) Nouvelles recherches sur la distribution florale. Bull Soc Vaud Sci Nat 44:223–270
go back to reference Krantz D, Luce R, Suppes P, Tversky A (1971) Foundations of measurement, vol I. Academic Press, New YorkMATH Krantz D, Luce R, Suppes P, Tversky A (1971) Foundations of measurement, vol I. Academic Press, New YorkMATH
go back to reference Lesot MJ, Rifqi M (2010) Order-based equivalence degrees for similarity and distance measures. In: Hllermeier E, Kruse R, Hoffmann F (eds) Computational intelligence for knowledge-based systems design. IPMU 2010, vol 6178. Lecture Notes in Computer Science, Springer, Berlin, Heidelberg, pp 19–28 Lesot MJ, Rifqi M (2010) Order-based equivalence degrees for similarity and distance measures. In: Hllermeier E, Kruse R, Hoffmann F (eds) Computational intelligence for knowledge-based systems design. IPMU 2010, vol 6178. Lecture Notes in Computer Science, Springer, Berlin, Heidelberg, pp 19–28
go back to reference Lesot MJ, Rifqi M, Benhadda H (2009) Similarity measures for binary and numerical data: a survey. Int J Knowl Eng Soft Data Paradig (KESDP) 1:63–84CrossRef Lesot MJ, Rifqi M, Benhadda H (2009) Similarity measures for binary and numerical data: a survey. Int J Knowl Eng Soft Data Paradig (KESDP) 1:63–84CrossRef
go back to reference Ochiai A (1957) Zoogeographic studies on the soleoid fishes found in Japan and its neighbouring regions. Bull Jpn Soc Sci Fish 22:526–30CrossRef Ochiai A (1957) Zoogeographic studies on the soleoid fishes found in Japan and its neighbouring regions. Bull Jpn Soc Sci Fish 22:526–30CrossRef
go back to reference Pelillo M (ed) (2013) Similarity-based pattern analysis and recognition. Advances in computer vision and pattern recognition. Springer, LondonMATH Pelillo M (ed) (2013) Similarity-based pattern analysis and recognition. Advances in computer vision and pattern recognition. Springer, LondonMATH
go back to reference Penney GP, Weese J, Little JA, Desmedt P, Hill DLG, Hawkes DJ (1998) A comparison of similarity measures for use in 2-D-3-D medical image registration. In: Proceedings of MICCAI 1998: medical image computing and computer-assisted intervention MICCAI98, vol. 1496. LNCS, pp 1153–1161 Penney GP, Weese J, Little JA, Desmedt P, Hill DLG, Hawkes DJ (1998) A comparison of similarity measures for use in 2-D-3-D medical image registration. In: Proceedings of MICCAI 1998: medical image computing and computer-assisted intervention MICCAI98, vol. 1496. LNCS, pp 1153–1161
go back to reference Rogers DJ, Tanimoto TT (1960) A computer program for classifying plants. Science 132:1115–1118CrossRef Rogers DJ, Tanimoto TT (1960) A computer program for classifying plants. Science 132:1115–1118CrossRef
go back to reference Sokal RR, Michener C (1958) A statistical method for evaluating systematic relationships. Univ Kansas Sci Bull 38:1409–1438 Sokal RR, Michener C (1958) A statistical method for evaluating systematic relationships. Univ Kansas Sci Bull 38:1409–1438
go back to reference Sokal RR, Sneath PHA (1963) Priciples of numerical taxonomy. W.H. Freeman, San Francisco Sokal RR, Sneath PHA (1963) Priciples of numerical taxonomy. W.H. Freeman, San Francisco
go back to reference Sorensen T (1948) A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. K Dan Vidensk Selsk Biol Skr 5:1–34 Sorensen T (1948) A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. K Dan Vidensk Selsk Biol Skr 5:1–34
go back to reference Simmons S, Estes Z (2008) Individual differences in the perception of similarity and difference. Cognition 106(3):781–795CrossRef Simmons S, Estes Z (2008) Individual differences in the perception of similarity and difference. Cognition 106(3):781–795CrossRef
go back to reference Suppes P, Krantz D, Luce R, Tversky A (1989) Foundations of measurement, vol II. Academic Press, New YorkMATH Suppes P, Krantz D, Luce R, Tversky A (1989) Foundations of measurement, vol II. Academic Press, New YorkMATH
go back to reference Toussaint GT (2004) A comparison of rhythmic similarity measures. In: Proceedings 5th international conference on music information retrieval Toussaint GT (2004) A comparison of rhythmic similarity measures. In: Proceedings 5th international conference on music information retrieval
Metadata
Title
A study of similarity measures through the paradigm of measurement theory: the classic case
Publication date
02-01-2019
Published in
Soft Computing / Issue 16/2019
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-018-03724-3

Other articles of this Issue 16/2019

Soft Computing 16/2019 Go to the issue

Premium Partner