Abstract
Transforming complex multivariate geological data to a Gaussian distribution is an important and challenging problem in geostatistics. A variety of transforms are available for this goal, but struggle with high dimensional data sets. Projection pursuit density estimation (PPDE) is a well-established nonparametric method for estimating the joint density of multivariate data. A central component of the PPDE algorithm transforms the original data toward a multivariate Gaussian distribution. The PPDE approach is modified to map complex data to a multivariate Gaussian distribution within a geostatistical modeling context. Traditional modeling may then take place on the transformed Gaussian data, with a back-transform used to return simulated variables to their original units. This approach is referred to as the projection pursuit multivariate transform (PPMT). The PPMT shows the potential to be an effective means for modeling high dimensional and complex geologic data. The PPMT algorithm is developed before discussing considerations and limitations. A case study compares modeling results against more common techniques to demonstrate the value and place of the PPMT within geostatistics.
Similar content being viewed by others
References
Almeida AS, Journel AG (1994) Joint simulation of multiple-variables with a Markov-type coregionalization model. Math Geol 26:565–588. doi:10.1007/Bf02089242
Babak O, Deutsch CV (2009) An intrinsic model of coregionalization that solves variance inflation in collocated cokriging. Comput Geosci 35:603–614
Barnett RM, Deutsch CV (2012) Practical implementation of non-linear transforms for modeling geometallurgical variables. In: Abrahamsen P, Hauge R, Kolbjornsen O (eds) Geostatistics Oslo 2012. Springer, Netherlands
Bellman RE (1957) Dynamic programming. Princeton University Press, New Jersey
Bliss C (1934) The method of probits. Science 79:39
Desbarats AJ, Dimitrakopoulos R (2000) Geostatistical simulation of regionalized poresize distributions using min/max autocorrelations factors. Math Geol 32:919–942
Deutsch CV, Journel AG (1998) GSLIB: a geostatistical software library and user’s guide, 2nd edn. Oxford University Press, New York
Friedman JH (1987) Exploratory projection pursuit. J Am Stat Assoc 82:249–266
Friedman JH, Tukey JW (1974) A projection pursuit algorithm for exploratory data analysis. IEEE Trans Comput C-23:881–890
Goovaerts P (1993) Spatial orthogonality of the principal components computed from coregionalized variables. Math Geol 25:281–302
Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:417–441
Huber PJ (1985) Projection pursuit. Ann Stat 13:435–475
Hwang J, Lay S, Lippman A (1994) Nonparametric multivariate density estimation: a comparative study. IEEE Trans Signal Process 42:2795–2810
Isaaks EH (1990) The application of Monte Carlo methods to the analysis of spatially correlated data. PhD dissertation, Stanford University, Stanford, USA
Johnson RJ, Wichern DW (1998) Applied multivariate statistical analysis, 4th edn. Prentice Hall, New Jersey
Johnston LP, Kramer MA (2011) Probability density estimation using elliptic basis functions. AIChE J 40:1639–1649
Journel AG, Huijbregts CJ (1978) Mining geostatistics. Academic Press, London
Leuangthong O, Deutsch CV (2003) Stepwise conditional transformation for simulation of multiple variables. Math Geol 35:155–173
Little RJ, Rubin DB (2002) Statistical analysis with missing data, 2nd edn. Wiley, New Jersey
Liu GR, Gu YT (2005) An introduction to meshfree methods and their programming. Springer, Netherlands
Martin-Fernandez JA, Barcelo-Vidal C, Pawlowsky-Glahn V (2003) Dealing with zeros and missing values in compositional data sets using nonparametric imputation. Math Geol 35:253–278
Mueller UA, Ferreira J (2012) The U-WEDGE transformation method for multivariate geostatistical simulation. Math Geosci 44:427–448
Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33:1065–1076
Rosenblatt M (1952) Remarks on a multivariate transformation. Ann Math Stat 23:470–472
Rosenblatt M (1956) Remarks on some nonparametric estimates of a density function. Ann Math Stat 27:832–837
Scott DW (1992) Multivariate density estimation: theory, practice, and visualization. Wiley, New York
Switzer P, Green AA (1984) Min/Max autocorrelation factors for multivariate spatial imaging. Department of Statistics technical report 6, Stanford University, Stanford, USA
Verly GW (1984) Estimation of spatial point and block distributions: the multi-Gaussian model. PhD dissertation, Stanford University, Stanford, USA
Zhu H, Journel AG (1993) Formatting and integrating soft data: stochastic imaging via the Markov–Bayes algorithm. In: Soares A (ed) Geostatistics Troia 1992. Springer, Netherlands
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Barnett, R.M., Manchuk, J.G. & Deutsch, C.V. Projection Pursuit Multivariate Transform. Math Geosci 46, 337–359 (2014). https://doi.org/10.1007/s11004-013-9497-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11004-013-9497-7