Skip to main content

2016 | OriginalPaper | Buchkapitel

Discovering Spatially Contiguous Clusters in Multivariate Geostatistical Data Through Spectral Clustering

verfasst von : Francky Fouedjio

Erschienen in: Advanced Data Mining and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Spectral clustering has recently become one of the most popular modern clustering algorithms for traditional data. However, the application of this clustering method on geostatistical data produces spatially scattered clusters, which is undesirable for many geoscience applications. In this work, we develop a spectral clustering method aimed to discover spatially contiguous and meaningful clusters in multivariate geostatistical data, in which spatial dependence plays an important role. The proposed spectral clustering method relies on a similarity measure built from a non-parametric kernel estimator of the multivariate spatial dependence structure of the data, emphasizing the spatial correlation among data locations. The capability of the proposed spectral clustering method to provide spatially contiguous and meaningful clusters is illustrated using the European Geological Surveys Geochemical database.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Allard, D.: Geostatistical classification and class kriging. J. Geog. Inf. Decis. Anal. 2, 87–101 (1998) Allard, D.: Geostatistical classification and class kriging. J. Geog. Inf. Decis. Anal. 2, 87–101 (1998)
2.
Zurück zum Zitat Allard, D., Guillot, G.: Clustering geostatistical data. In: Proceedings of the Sixth Geostatistical Conference (2000) Allard, D., Guillot, G.: Clustering geostatistical data. In: Proceedings of the Sixth Geostatistical Conference (2000)
3.
Zurück zum Zitat Allard, D., Monestiez, P.: Geostatistical segmentation of rainfall data. In: geoENV II: Geostatistics for Environmental Applications, pp. 139–150 (1999) Allard, D., Monestiez, P.: Geostatistical segmentation of rainfall data. In: geoENV II: Geostatistics for Environmental Applications, pp. 139–150 (1999)
4.
Zurück zum Zitat Ambroise, C., Dang, M., Govaert, G.: Clustering of spatial data by the EM algorithm. In: geoENV I: Geostatistics for Environmental Applications, pp. 493–504 (1995) Ambroise, C., Dang, M., Govaert, G.: Clustering of spatial data by the EM algorithm. In: geoENV I: Geostatistics for Environmental Applications, pp. 493–504 (1995)
5.
Zurück zum Zitat Bourgault, G., Marcotte, D., Legendre, P.: The multivariate (co)variogram as a spatial weighting function in classification methods. Math. Geol. 24(5), 463–478 (1992)CrossRef Bourgault, G., Marcotte, D., Legendre, P.: The multivariate (co)variogram as a spatial weighting function in classification methods. Math. Geol. 24(5), 463–478 (1992)CrossRef
6.
Zurück zum Zitat Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. 3(1), 1–27 (1974)MathSciNetMATH Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. 3(1), 1–27 (1974)MathSciNetMATH
7.
8.
Zurück zum Zitat Charu, C., Chandan, K.: Data Clustering: Algorithms and Applications. Chapman and Hall/CRC, Boca Raton (2013)MATH Charu, C., Chandan, K.: Data Clustering: Algorithms and Applications. Chapman and Hall/CRC, Boca Raton (2013)MATH
9.
Zurück zum Zitat Chilès, J.P., Delfiner, P.: Geostatistics: Modeling Spatial Uncertainty. Wiley, Hoboken (2012)CrossRefMATH Chilès, J.P., Delfiner, P.: Geostatistics: Modeling Spatial Uncertainty. Wiley, Hoboken (2012)CrossRefMATH
10.
Zurück zum Zitat Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via EM algorithm (with discussion). J. Roy. Stat. Soc. Ser. 39, 1–38 (1977)MathSciNetMATH Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via EM algorithm (with discussion). J. Roy. Stat. Soc. Ser. 39, 1–38 (1977)MathSciNetMATH
11.
Zurück zum Zitat Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering. Pattern Recogn. 41(1), 176–190 (2008)CrossRefMATH Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering. Pattern Recogn. 41(1), 176–190 (2008)CrossRefMATH
12.
Zurück zum Zitat Fouedjio, F.: A clustering approach for discovering intrinsic clusters in multivariate geostatistical data. In: Perner, P. (ed.) MLDM 2016. LNCS, vol. 9729, pp. 491–500. Springer, Switzerland (2016)CrossRef Fouedjio, F.: A clustering approach for discovering intrinsic clusters in multivariate geostatistical data. In: Perner, P. (ed.) MLDM 2016. LNCS, vol. 9729, pp. 491–500. Springer, Switzerland (2016)CrossRef
13.
Zurück zum Zitat Fouedjio, F.: A hierarchical clustering method for multivariate geostatistical data. Spatial Statistics (2016) Fouedjio, F.: A hierarchical clustering method for multivariate geostatistical data. Spatial Statistics (2016)
14.
Zurück zum Zitat Guillot, G., Kan-King-Yu, D., Michelin, J., Huet, P.: Inference of a hidden spatial tessellation from multivariate data: application to the delineation of homogeneous regions in an agricultural field. J. Roy. Stat. Soc. Ser. C (Appl. Stat.) 55(3), 407–430 (2006)MathSciNetCrossRefMATH Guillot, G., Kan-King-Yu, D., Michelin, J., Huet, P.: Inference of a hidden spatial tessellation from multivariate data: application to the delineation of homogeneous regions in an agricultural field. J. Roy. Stat. Soc. Ser. C (Appl. Stat.) 55(3), 407–430 (2006)MathSciNetCrossRefMATH
15.
Zurück zum Zitat Haas, T.C.: Lognormal and moving window methods of estimating acid deposition. J. Am. Stat. Assoc. 85(412), 950–963 (1990)CrossRef Haas, T.C.: Lognormal and moving window methods of estimating acid deposition. J. Am. Stat. Assoc. 85(412), 950–963 (1990)CrossRef
16.
Zurück zum Zitat Journel, A., Huijbregts, C.: Mining Geostatistics. Blackburn Press, New York (2003) Journel, A., Huijbregts, C.: Mining Geostatistics. Blackburn Press, New York (2003)
18.
Zurück zum Zitat Lado, L., Hengl, T., Reuter, I.: Heavy metals in European soils: a geostatistical analysis of the FOREGS geochemical database. Geoderma 148(2), 189–199 (2008)CrossRef Lado, L., Hengl, T., Reuter, I.: Heavy metals in European soils: a geostatistical analysis of the FOREGS geochemical database. Geoderma 148(2), 189–199 (2008)CrossRef
21.
Zurück zum Zitat Luxburg, U.V., Bousquet, O., Belkin, M.: Limits of spectral clustering. In: Advances in Neural Information Processing Systems, pp. 857–864 (2004) Luxburg, U.V., Bousquet, O., Belkin, M.: Limits of spectral clustering. In: Advances in Neural Information Processing Systems, pp. 857–864 (2004)
22.
Zurück zum Zitat Nascimento, M.C., Carvalho, A.C.: Spectral methods for graph clustering – a survey. Eu. J. Oper. Res. 211(2), 221–231 (2011)MathSciNetCrossRefMATH Nascimento, M.C., Carvalho, A.C.: Spectral methods for graph clustering – a survey. Eu. J. Oper. Res. 211(2), 221–231 (2011)MathSciNetCrossRefMATH
23.
Zurück zum Zitat Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856. MIT Press (2001) Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856. MIT Press (2001)
24.
Zurück zum Zitat Olivier, M., Webster, R.: A geostatistical basis for spatial weighting in multivariate classification. Math. Geol. 21, 15–35 (1989)CrossRef Olivier, M., Webster, R.: A geostatistical basis for spatial weighting in multivariate classification. Math. Geol. 21, 15–35 (1989)CrossRef
25.
Zurück zum Zitat Pawitan, Y., Huang, J.: Constrained clustering of irregularly sampled spatial data. J. Stat. Comput. Simul. 73(12), 853–865 (2003)MathSciNetCrossRefMATH Pawitan, Y., Huang, J.: Constrained clustering of irregularly sampled spatial data. J. Stat. Comput. Simul. 73(12), 853–865 (2003)MathSciNetCrossRefMATH
26.
Zurück zum Zitat Romary, T., Ors, F., Rivoirard, J., Deraisme, J.: Unsupervised classification of multivariate geostatistical data: two algorithms. Comput. Geosci. 85, 96–103 (2015)CrossRef Romary, T., Ors, F., Rivoirard, J., Deraisme, J.: Unsupervised classification of multivariate geostatistical data: two algorithms. Comput. Geosci. 85, 96–103 (2015)CrossRef
27.
28.
Zurück zum Zitat Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press, New York (2009)MATH Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press, New York (2009)MATH
29.
Zurück zum Zitat Tobler, W.R.: A computer movie simulating urban growth in the Detroit region. Econ. Geogr. 46, 234–240 (1970)CrossRef Tobler, W.R.: A computer movie simulating urban growth in the Detroit region. Econ. Geogr. 46, 234–240 (1970)CrossRef
30.
Zurück zum Zitat Wand, M., Jones, C.: Kernel Smoothing. Monographs on Statistics and Applied Probability. Chapman & Hall, Sanford (1995)CrossRef Wand, M., Jones, C.: Kernel Smoothing. Monographs on Statistics and Applied Probability. Chapman & Hall, Sanford (1995)CrossRef
31.
Zurück zum Zitat Zha, H., He, X., Ding, C., Gu, M., Simon, H.D.: Spectral relaxation for k-means clustering. In: Advances in Neural Information Processing Systems, pp. 1057–1064 (2001) Zha, H., He, X., Ding, C., Gu, M., Simon, H.D.: Spectral relaxation for k-means clustering. In: Advances in Neural Information Processing Systems, pp. 1057–1064 (2001)
Metadaten
Titel
Discovering Spatially Contiguous Clusters in Multivariate Geostatistical Data Through Spectral Clustering
verfasst von
Francky Fouedjio
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-49586-6_38