Skip to main content
Erschienen in: Quality & Quantity 4/2014

01.07.2014

Visual displays: analytical study and applications to graphs and real data

verfasst von: K. Fernández-Aguirre, M. A. Garín-Martín, J. I. Modroño-Herrán

Erschienen in: Quality & Quantity | Ausgabe 4/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Principal axis methods such as principal component analysis (PCA) and correspondence analysis (CA) are useful for identifying structures in data through interesting planar graphic displays. However, some kinds of data sets can be dealt alternatively with PCA or CA. This paper focuses on methods, such as PCA and CA, and on visual displays. Our aim is to illustrate the implications for a potential user of selecting either method, and its advantages and disadvantages, from an applied point of view. This is a matter covered broadly in textbooks and elsewhere considering theoretical arguments. Our purpose is to contribute to the comparison between these methods, over the same data set, in order to illustrate them for the practitioner. In the first part of this paper we present a novel analytical study of a binary matrix associated with a non-oriented axis-symmetric graph and show that CA outperforms standardized PCA for the reconstitution and visualization of such kind of graphs. In the second part we present a case using real data dealing with the distribution of employees in different economic sectors for the countries of the European Union, analyzed by means of standardized PCA and two-way CA, in order to see the differences between the two methods in practice.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bécue-Bertaut, M., Fernández-Aguirre, K., Modroño-Herran, J.I.: Analysis of a mixture of closed and open-ended questions in the case of a multilingual survey. In: Skiadas, C.H. (ed.) Advances in data analysis: theory and applications to reliability and inference, data mining, bioinformatics, lifetime data, and neural networks, pp. 23–34. Birkhäuser, Boston (2009) Bécue-Bertaut, M., Fernández-Aguirre, K., Modroño-Herran, J.I.: Analysis of a mixture of closed and open-ended questions in the case of a multilingual survey. In: Skiadas, C.H. (ed.) Advances in data analysis: theory and applications to reliability and inference, data mining, bioinformatics, lifetime data, and neural networks, pp. 23–34. Birkhäuser, Boston (2009)
Zurück zum Zitat Beh, E.: Simple correspondence analysis: a bibliographic review. Int. Stat. Rev. 72(2), 257–284 (2004a)CrossRef Beh, E.: Simple correspondence analysis: a bibliographic review. Int. Stat. Rev. 72(2), 257–284 (2004a)CrossRef
Zurück zum Zitat Benzécri, J.P.: L’Analyse des Données, La Taxinomie (T. I); L’Analyse des Correspondances (T. II). Dunod, Paris (1973) Benzécri, J.P.: L’Analyse des Données, La Taxinomie (T. I); L’Analyse des Correspondances (T. II). Dunod, Paris (1973)
Zurück zum Zitat Blasius, J., Greenacre, M., Groenen, P.J.F., van de Velden, M.: Special issue on correspondence analysis and related methods. Comput. Stat. Data Anal. 53(8), 3103–3257 (2009)CrossRef Blasius, J., Greenacre, M., Groenen, P.J.F., van de Velden, M.: Special issue on correspondence analysis and related methods. Comput. Stat. Data Anal. 53(8), 3103–3257 (2009)CrossRef
Zurück zum Zitat D’Ambra, L., Lauro, N.: Non symmetrical analysis of three way contingency tables. In: Coppi, R., Bolasco, S. (eds.) Multiway data analysis, pp. 301–315. North-Holland Publishing Co., Amsterdam (1989) D’Ambra, L., Lauro, N.: Non symmetrical analysis of three way contingency tables. In: Coppi, R., Bolasco, S. (eds.) Multiway data analysis, pp. 301–315. North-Holland Publishing Co., Amsterdam (1989)
Zurück zum Zitat Escofier, B., Pagès, J.: Analyses factorielles simples et multiples Objectifs, méthodes et interprétation, 4th edn. Dunod, Paris (2008) Escofier, B., Pagès, J.: Analyses factorielles simples et multiples Objectifs, méthodes et interprétation, 4th edn. Dunod, Paris (2008)
Zurück zum Zitat Fernández-Aguirre, K.: Multiple correspondence analysis and related methods. J. Appl. Stat. 34(7), 879–885 (2007)CrossRef Fernández-Aguirre, K.: Multiple correspondence analysis and related methods. J. Appl. Stat. 34(7), 879–885 (2007)CrossRef
Zurück zum Zitat Gifi, A.: Non linear multivariate analysis. Wiley, Chichester (1990) Gifi, A.: Non linear multivariate analysis. Wiley, Chichester (1990)
Zurück zum Zitat Greenacre, M.J.: Theory and applications of correspondence analysis. Academic Press, London (1984) Greenacre, M.J.: Theory and applications of correspondence analysis. Academic Press, London (1984)
Zurück zum Zitat Greenacre, M.J., Blasius, J. (eds.): Multiple correspondence analysis and related methods. Chapman & Hall/CRC, London (2006) Greenacre, M.J., Blasius, J. (eds.): Multiple correspondence analysis and related methods. Chapman & Hall/CRC, London (2006)
Zurück zum Zitat Jolliffe, I.T.: Principal component analysis and exploratory factor analysis. Stat. Methods Med. Res. 1(1), 69–95 (1992)CrossRef Jolliffe, I.T.: Principal component analysis and exploratory factor analysis. Stat. Methods Med. Res. 1(1), 69–95 (1992)CrossRef
Zurück zum Zitat Jolliffe, I.T.: Principal component analysis. Springer series in statistics, 2nd edn. Springer, New York (2002) Jolliffe, I.T.: Principal component analysis. Springer series in statistics, 2nd edn. Springer, New York (2002)
Zurück zum Zitat Lauro, N., D’Ambra, L.: L’Analyse non symétrique des correspondances. Data Anal. Inform. III, 433–446 (1984) Lauro, N., D’Ambra, L.: L’Analyse non symétrique des correspondances. Data Anal. Inform. III, 433–446 (1984)
Zurück zum Zitat Le Roux, B., Rouanet, H.: Geometric data analysis: from correspondence analysis to structured data analysis. Kluwer Academic Publishers, Dordrecht (2004) Le Roux, B., Rouanet, H.: Geometric data analysis: from correspondence analysis to structured data analysis. Kluwer Academic Publishers, Dordrecht (2004)
Zurück zum Zitat Le Roux, B., Rouanet, H.: Multiple correspondence analysis, quantitative applications in the social sciences series, vol. 163. SAGE Publications, Thousand Oaks (2010)CrossRef Le Roux, B., Rouanet, H.: Multiple correspondence analysis, quantitative applications in the social sciences series, vol. 163. SAGE Publications, Thousand Oaks (2010)CrossRef
Zurück zum Zitat Lebart, L.: Exploratory analysis of large sparse matrices, with application to textual data, COMPSTAT. Physica Verlag, Vienna (1982) Lebart, L.: Exploratory analysis of large sparse matrices, with application to textual data, COMPSTAT. Physica Verlag, Vienna (1982)
Zurück zum Zitat Lebart, L., Morineau, A., Warwick, K.: Multivariate descriptive statistical analysis. Wiley, New York (1984) Lebart, L., Morineau, A., Warwick, K.: Multivariate descriptive statistical analysis. Wiley, New York (1984)
Zurück zum Zitat Lebart, L., Salem, A., Berry, L.: Exploring textual data. Kluwer Academic Publishers, New York (1998)CrossRef Lebart, L., Salem, A., Berry, L.: Exploring textual data. Kluwer Academic Publishers, New York (1998)CrossRef
Zurück zum Zitat Lebart, L., Tabard, N.: Recherches sur la description automatique des données socio-économiques, (CORDES-DGRST, 1973 - CR No. 13/1971). Presented at the third European meeting of the Psychometric Society, Jouy-en-Josas, France, July (1983). Lebart, L., Tabard, N.: Recherches sur la description automatique des données socio-économiques, (CORDES-DGRST, 1973 - CR No. 13/1971). Presented at the third European meeting of the Psychometric Society, Jouy-en-Josas, France, July (1983).
Zurück zum Zitat Murtagh, F.: Correspondence analysis and data coding with Java and R. Chapman and Hall/CRC, Boca Raton (2005)CrossRef Murtagh, F.: Correspondence analysis and data coding with Java and R. Chapman and Hall/CRC, Boca Raton (2005)CrossRef
Zurück zum Zitat Murtagh, F.: Sparse image and signal processing. Cambridge University Press, Cambridge (2010) Murtagh, F.: Sparse image and signal processing. Cambridge University Press, Cambridge (2010)
Zurück zum Zitat Pham, N.-K., Morin, A., Gros, P., Le, Q.-T.: Intensive use of factorial correspondence analysis for large scale content-based image retrieval. In: Guillet, F., Ritschard, G., Zighed, D.A., Briand, H. (eds.) Advances in knowledge discovery and management, AKDM’09. Studies in computational intelligence, vol. 292, pp. 57–76. Springer, Berlin/Heidelberg (2010) Pham, N.-K., Morin, A., Gros, P., Le, Q.-T.: Intensive use of factorial correspondence analysis for large scale content-based image retrieval. In: Guillet, F., Ritschard, G., Zighed, D.A., Briand, H. (eds.) Advances in knowledge discovery and management, AKDM’09. Studies in computational intelligence, vol. 292, pp. 57–76. Springer, Berlin/Heidelberg (2010)
Zurück zum Zitat R Development Core Team: R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. ISBN 3-900051-07-0. http://www.R-project.org (2011) R Development Core Team: R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. ISBN 3-900051-07-0. http://​www.​R-project.​org (2011)
Zurück zum Zitat Reed, K.: Multiple correspondence analysis and related methods. Comput. Stat. Data Anal. 53(8), 3245–3257 (2009)CrossRef Reed, K.: Multiple correspondence analysis and related methods. Comput. Stat. Data Anal. 53(8), 3245–3257 (2009)CrossRef
Zurück zum Zitat Shepard, R., Carroll, J.D.: Parametric representation of nonlinear data structures. In: Krishnaiah, P.R. (ed.) Multivariate analysis, pp. 561–592. Academic Press, New York (1966) Shepard, R., Carroll, J.D.: Parametric representation of nonlinear data structures. In: Krishnaiah, P.R. (ed.) Multivariate analysis, pp. 561–592. Academic Press, New York (1966)
Zurück zum Zitat Tenenhaus, M., Young, F.W.: An analysis and synthesis of multiple correspondence analysis, optimal scaling, dual scaling, homogeneity analysis and other methods for quantifying categorical multivariate data. Psychometrika 21, 91–119 (1985)CrossRef Tenenhaus, M., Young, F.W.: An analysis and synthesis of multiple correspondence analysis, optimal scaling, dual scaling, homogeneity analysis and other methods for quantifying categorical multivariate data. Psychometrika 21, 91–119 (1985)CrossRef
Zurück zum Zitat von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007)CrossRef von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007)CrossRef
Zurück zum Zitat Zárraga, A., Goitisolo, B.: Simultaneous analysis and multiple factor analysis for contingency tables: two methods for the joint study of contingency tables. Comput. Stat. Data Anal. 53(8), 3171–3182 (2009)CrossRef Zárraga, A., Goitisolo, B.: Simultaneous analysis and multiple factor analysis for contingency tables: two methods for the joint study of contingency tables. Comput. Stat. Data Anal. 53(8), 3171–3182 (2009)CrossRef
Metadaten
Titel
Visual displays: analytical study and applications to graphs and real data
verfasst von
K. Fernández-Aguirre
M. A. Garín-Martín
J. I. Modroño-Herrán
Publikationsdatum
01.07.2014
Verlag
Springer Netherlands
Erschienen in
Quality & Quantity / Ausgabe 4/2014
Print ISSN: 0033-5177
Elektronische ISSN: 1573-7845
DOI
https://doi.org/10.1007/s11135-013-9887-4

Weitere Artikel der Ausgabe 4/2014

Quality & Quantity 4/2014 Zur Ausgabe

Premium Partner