Skip to main content

2019 | OriginalPaper | Buchkapitel

Statistical Archetypal Analysis for Cognitive Categorization

verfasst von : Francesco Santelli, Francesco Palumbo, Giancarlo Ragozini

Erschienen in: New Statistical Developments in Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Human knowledge develops through complex relationships between categories. In the era of Big Data, the concept of categorization implies data summarization in a limited number of well-separated groups that must be maximally and internally homogeneous at the same time. This proposal exploits archetypal analysis capabilities by finding a set of extreme points that can summarize entire data sets in homogeneous groups. The archetypes are then used to identify the best prototypes according to Rosch’s definition. Finally, in the geometric approach to cognitive science, the Voronoi tessellation based on the prototypes is used to define categorization. An example using a well-known wine dataset by Forina et al. illustrates the procedure.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Adams, B., Raubal, M.: A metric conceptual space algebra. In: Spatial Information Theory. Springer (2009) Adams, B., Raubal, M.: A metric conceptual space algebra. In: Spatial Information Theory. Springer (2009)
2.
Zurück zum Zitat Aitchison, J., Barceló-Vidal, C., Martín-Fernández, J., Pawlowsky-Glahn, V.: Logratio analysis and compositional distance. Math. Geol. 32(3), 271–275 (2000)MATHCrossRef Aitchison, J., Barceló-Vidal, C., Martín-Fernández, J., Pawlowsky-Glahn, V.: Logratio analysis and compositional distance. Math. Geol. 32(3), 271–275 (2000)MATHCrossRef
3.
Zurück zum Zitat Barsalou, L.W.: Deriving categories to achieve goals. In: Psychology of Learning and Motivation. Elsevier (1991) Barsalou, L.W.: Deriving categories to achieve goals. In: Psychology of Learning and Motivation. Elsevier (1991)
5.
Zurück zum Zitat Bauckhage, C., Thurau, C.: Making archetypal analysis practical. In: DAGM-Symposium, pp. 272–281 (2009) Bauckhage, C., Thurau, C.: Making archetypal analysis practical. In: DAGM-Symposium, pp. 272–281 (2009)
6.
Zurück zum Zitat Chang, F., Lin, C.C., Lu, C.J.: Adaptive prototype learning algorithms: theoretical and experimental studies. J. Mach. Learn. Res. 7, 2125–2148 (2006)MathSciNetMATH Chang, F., Lin, C.C., Lu, C.J.: Adaptive prototype learning algorithms: theoretical and experimental studies. J. Mach. Learn. Res. 7, 2125–2148 (2006)MathSciNetMATH
7.
Zurück zum Zitat Cowan, N.: The magical mystery four: how is working memory capacity limited, and why? Curr. Dir. Psychol. Sci. 19(1), 51–57 (2010)CrossRef Cowan, N.: The magical mystery four: how is working memory capacity limited, and why? Curr. Dir. Psychol. Sci. 19(1), 51–57 (2010)CrossRef
9.
Zurück zum Zitat D’Esposito, M., Palumbo, F., Ragozini, G.: Interval archetypes: a new tool for interval data analysis. Stat. Anal. Data Min. 5(4), 322–335 (2012)MathSciNetCrossRef D’Esposito, M., Palumbo, F., Ragozini, G.: Interval archetypes: a new tool for interval data analysis. Stat. Anal. Data Min. 5(4), 322–335 (2012)MathSciNetCrossRef
10.
Zurück zum Zitat D’Esposito, M.R., Palumbo, F., Ragozini, G.: Archetypal Symbolic Objects, pp. 41–49. Springer, Berlin (2013)CrossRef D’Esposito, M.R., Palumbo, F., Ragozini, G.: Archetypal Symbolic Objects, pp. 41–49. Springer, Berlin (2013)CrossRef
12.
Zurück zum Zitat Diday, E.: Categorization in symbolic data analysis. In: Cohen, H., Lefebvre, C. (eds.) Handbook of Categorization in Cognitive Science, pp. 845–867. Elsevier Science Ltd, Oxford (2005)CrossRef Diday, E.: Categorization in symbolic data analysis. In: Cohen, H., Lefebvre, C. (eds.) Handbook of Categorization in Cognitive Science, pp. 845–867. Elsevier Science Ltd, Oxford (2005)CrossRef
13.
Zurück zum Zitat Edelsbrunner, H.: Algorithms in Combinatorial Geometry, vol. 10. Springer-Verlag, Berlin (1987)MATHCrossRef Edelsbrunner, H.: Algorithms in Combinatorial Geometry, vol. 10. Springer-Verlag, Berlin (1987)MATHCrossRef
14.
Zurück zum Zitat Eugster, M., Leisch, F., Seth, S.: Archetypes: archetypal analysis. In: R Package Version, pp. 2–2 (2014) Eugster, M., Leisch, F., Seth, S.: Archetypes: archetypal analysis. In: R Package Version, pp. 2–2 (2014)
15.
Zurück zum Zitat Fordellone, M., Palumbo, F.: Prototypes definition through consensus analysis between fuzzy c-means and archetypal analysis. Ital. J. Appl. Stat. 26(2), 141–162 (2014) Fordellone, M., Palumbo, F.: Prototypes definition through consensus analysis between fuzzy c-means and archetypal analysis. Ital. J. Appl. Stat. 26(2), 141–162 (2014)
16.
Zurück zum Zitat Forina, M., Armanino, C., Castino, M., Ubigli, M.: Multivariate data analysis as a discriminating method of the origin of wines. Vitis 25, 189–201 (1986) Forina, M., Armanino, C., Castino, M., Ubigli, M.: Multivariate data analysis as a discriminating method of the origin of wines. Vitis 25, 189–201 (1986)
17.
Zurück zum Zitat Frigui, H., Nasraoui, O.: Unsupervised learning of prototypes and attribute weights. Pattern Recognit. 37(3), 567–581 (2004)CrossRef Frigui, H., Nasraoui, O.: Unsupervised learning of prototypes and attribute weights. Pattern Recognit. 37(3), 567–581 (2004)CrossRef
18.
Zurück zum Zitat Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. A Bradford Book, vol. 3, p. 16. MIT Press (2000) Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. A Bradford Book, vol. 3, p. 16. MIT Press (2000)
19.
Zurück zum Zitat Gärdenfors, P.: Concept Learning and Nonmonotonic Reasoning, pp. 823–843. Elsevier (2005) Gärdenfors, P.: Concept Learning and Nonmonotonic Reasoning, pp. 823–843. Elsevier (2005)
21.
Zurück zum Zitat Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer (2011) Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer (2011)
22.
Zurück zum Zitat Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)CrossRef Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)CrossRef
24.
Zurück zum Zitat Lesot, M., Kruse, R.: Typicality degrees and fuzzy prototypes for clustering. In: Advances in Data Analysis, pp. 107–114. Springer (2007) Lesot, M., Kruse, R.: Typicality degrees and fuzzy prototypes for clustering. In: Advances in Data Analysis, pp. 107–114. Springer (2007)
25.
Zurück zum Zitat Lesot, M., Rifqi, M., Bouchon-Meunier, B.: Fuzzy prototypes: from a cognitive view to a machine learning principle. In: Fuzzy Sets and Their Extensions: Representation, Aggregation and Models, pp. 431–452. Springer (2008) Lesot, M., Rifqi, M., Bouchon-Meunier, B.: Fuzzy prototypes: from a cognitive view to a machine learning principle. In: Fuzzy Sets and Their Extensions: Representation, Aggregation and Models, pp. 431–452. Springer (2008)
26.
Zurück zum Zitat Okabe, A., Boots, B., Sugihara, K.: Nearest neighbourhood operations with generalized voronoi diagrams: a review. Int. J. Geogr. Inf. Syst. 8(1), 43–71 (1994)CrossRef Okabe, A., Boots, B., Sugihara, K.: Nearest neighbourhood operations with generalized voronoi diagrams: a review. Int. J. Geogr. Inf. Syst. 8(1), 43–71 (1994)CrossRef
27.
Zurück zum Zitat Papadopoulou, E., Lee, D.T.: The Hausdorff Voronoi diagram of polygonal objects: a divide and conquer approach. Int. J. Comput. Geom. Appl. 14(06), 421–452 (2004)MathSciNetMATHCrossRef Papadopoulou, E., Lee, D.T.: The Hausdorff Voronoi diagram of polygonal objects: a divide and conquer approach. Int. J. Comput. Geom. Appl. 14(06), 421–452 (2004)MathSciNetMATHCrossRef
28.
Zurück zum Zitat Porzio, G.C., Ragozini, G., Vistocco, D.: On the use of archetypes as benchmarks. Appl. Stoch. Model. Bus. Ind. 25, 419–437 (2008)MathSciNetMATHCrossRef Porzio, G.C., Ragozini, G., Vistocco, D.: On the use of archetypes as benchmarks. Appl. Stoch. Model. Bus. Ind. 25, 419–437 (2008)MathSciNetMATHCrossRef
29.
Zurück zum Zitat Ragozini, G., Palumbo, F., D’Esposito, M.R.: Archetypal analysis for data-driven prototype identification. Stat. Anal. Data Min.: ASA Data Sci. J. (2016) Ragozini, G., Palumbo, F., D’Esposito, M.R.: Archetypal analysis for data-driven prototype identification. Stat. Anal. Data Min.: ASA Data Sci. J. (2016)
31.
Zurück zum Zitat Robert, S.: Categorization, reasoning and memory from a neo-logical point of view. In: Cohen, H., Lefebvre, C. (eds.) Handbook of Categorization in Cognitive Science, pp. 700-718. Elsevier (2005) Robert, S.: Categorization, reasoning and memory from a neo-logical point of view. In: Cohen, H., Lefebvre, C. (eds.) Handbook of Categorization in Cognitive Science, pp. 700-718. Elsevier (2005)
32.
33.
Zurück zum Zitat Rosch, E.: Prototype classification and logical classification: the two systems. In: New Trends in Conceptual Representation: Challenges to Piaget’s Theory, pp. 73–86 (1983) Rosch, E.: Prototype classification and logical classification: the two systems. In: New Trends in Conceptual Representation: Challenges to Piaget’s Theory, pp. 73–86 (1983)
35.
Zurück zum Zitat Watson, D.F.: Computing the n-dimensional delaunay tessellation with application to Voronoi polytopes. Comput. J. 24(2), 167–172 (1981)MathSciNetCrossRef Watson, D.F.: Computing the n-dimensional delaunay tessellation with application to Voronoi polytopes. Comput. J. 24(2), 167–172 (1981)MathSciNetCrossRef
36.
Zurück zum Zitat Zenker, F., Gärdenfors, P. (ed.): Applications of Conceptual Spaces: The Case for Geometric Knowledge Representation. Springer, Berlin; Synthese Library: Studies in Epistemology, Logic, Methodology, and Philosophy of Science. 24(2), 2–10. Springer (2015) Zenker, F., Gärdenfors, P. (ed.): Applications of Conceptual Spaces: The Case for Geometric Knowledge Representation. Springer, Berlin; Synthese Library: Studies in Epistemology, Logic, Methodology, and Philosophy of Science. 24(2), 2–10. Springer (2015)
Metadaten
Titel
Statistical Archetypal Analysis for Cognitive Categorization
verfasst von
Francesco Santelli
Francesco Palumbo
Giancarlo Ragozini
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-21158-5_7