Selection of coherent deposit-type locations and their application in data-driven mineral prospectivity mapping

https://doi.org/10.1016/j.oregeorev.2007.07.001Get rights and content

Abstract

Data-driven prospectivity mapping can be undermined by dissimilarity in multivariate spatial data signatures of deposit-type locations. Most cases of data-driven prospectivity mapping, however, make use of training sets of randomly selected deposit-type locations with the implicit assumption that they are coherent (i.e., with similar multivariate spatial data signatures). This study shows that the quality of data-driven prospectivity mapping can be improved by using a training set of coherent deposit-type locations. Analysis and selection of coherent deposit-type locations was performed via logistic regression, by using multiple sets of deposit occurrence favourability scores of univariate geoscience spatial data as independent variables and binary deposit occurrence scores as dependent variable. The set of coherent deposit-type locations and three sets of randomly selected deposit-type locations were each used in data-driven prospectivity mapping via application of evidential belief functions. The prospectivity map based on the training set of coherent deposit-type locations resulted in lower uncertainty, better goodness-of-fit to the training set, and better predictive capacity against a cross-validation set of economic deposits of the type sought. This study shows that explicit selection of training set of coherent deposit-type locations should be applied in data-driven prospectivity mapping.

Introduction

Known deposit-type locations are samples of a mineralized landscape. They are used in data-driven prospectivity mapping to predict sampling (exploration) targets for undiscovered deposit-type locations in the mineralized landscape. After creation of a GIS geoscience spatial database, there are three stages in data-driven prospectivity mapping: (1) selection of training deposit-type locations; (2) creation of prospectivity map; and (3) cross-validation of prospectivity map. In the first stage, the common practice is random selection of training deposit-type locations with the tacit assumption that they are coherent (i.e., having strongly similar characteristics). In the second stage, spatial associations between training deposit-type locations and individual evidential variables are quantified to create predictor maps, which are integrated to derive a prospectivity map. In the third stage, the prospectivity map is validated in terms of goodness-of-fit to training deposit-type locations and is cross-validated in terms of predictive capacity against deposit-type locations not used to create the predictor maps. The validation and cross-validation of a prospectivity map indicate its usefulness as a predictive tool to guide further exploration toward undiscovered deposit-type locations in the mineralized landscape.

Explicit selection of coherent training deposit-type locations is not practiced in most cases, even though it is crucial to prospectivity mapping. Every mineral deposit, even if classified into a deposit-type, is unique and has characteristics that are, to a certain extent, dissimilar to other mineral deposits of the same type. It follows that multivariate spatial data signatures of deposit-type locations are, to a certain extent, dissimilar or non-coherent. Random selection of deposit-type locations is therefore likely to result in a non-coherent training set. Dissimilarity in multivariate spatial data signatures of deposit-type locations in a non-coherent training set can undermine the quality of a prospectivity map. It is hypothesized here that uncertainty of a prospectivity map can be reduced and that goodness-of-fit and predictive capacity of a prospectivity map can be improved by using a training set of coherent deposit-type locations (i.e., with similar multivariate spatial data signatures).

This paper demonstrates a two-step methodology to select, from all known deposit-type locations in a study area, a set of coherent deposit-type locations. Then, to test and demonstrate the hypothesis stated above, performance of a prospectivity map based on the set of coherent deposit-type locations is compared and contrasted with performances of prospectivity maps based on sets of randomly selected deposit-type locations. The performance indices used are map uncertainty, goodness-of-fit to training deposit-type locations, and predictive capacity against cross-validation deposit-type locations. The proposed technique of selecting a training set of coherent deposit-type locations and the hypothesis of utility of such a training set are tested and demonstrated in data-driven mapping of prospectivity for alkalic porphyry Cu–Au deposits in British Columbia.

This paper is organized into seven sections. After this introduction, the second section describes the geology/mineralization of the test area and the geoscience spatial data sets used in the study. The third section discusses the first step in selecting coherent deposit-type locations (i.e., analysis of mineral occurrence favourability scores of univariate geoscience spatial data with respect to the deposit and non-deposit locations). The fourth section discusses the second step in selecting coherent deposit-type locations (i.e., analysis of deposit-type locations with similar multivariate spatial data signatures via logistic regression using binary deposit occurrence scores as dependent variable and multiple sets of deposit occurrence favourability scores of univariate geoscience spatial data as independent variables). The fifth section describes and compares results of prospectivity mapping using the training set of coherent deposit-type locations with those obtained using training sets of randomly selected deposit-type locations. The sixth and seventh sections provide, respectively, overall discussion and conclusions of the study.

Section snippets

The test area and geoscience spatial data sets

Most of the province of British Columbia (Fig. 1), in western Canada, forms part of the Canadian Cordillera, which in turn is part of the Cordilleran Orogenic belt in the North American continent. The Canadian Cordillera is an orogenic collage of five major fault-bounded morphogeological and physiographic belts (Fig. 1), which developed from Mesozoic to post-mid Tertiary events, and docked onto the North American craton (Gabrielse et al., 1991, McMillan, 1991a). Each belt has distinct

Mineral occurrence favourability scores of univariate geoscience spatial data

This section describes and discusses the first step in selecting coherent deposit locations by analysis of mineral occurrence favourability scores of univariate geoscience spatial data with respect to deposit and non-deposit locations.

Analysis and selection of coherent deposit locations

This section describes and discusses the second step in selecting coherent deposit locations by analysis of deposit-type locations with similar multivariate spatial data signatures.

Data-driven prospectivity mapping

Data-driven prospectivity maps can be created via [weighted] logistic regression (Chung and Agterberg, 1980, Reddy et al., 1991, Carranza and Hale, 2001). In fact, the results shown in Fig. 5, Fig. 6 indicate some non-deposit locations having multivariate spatial data signatures similar to several deposit locations and these are thus plausible prospective targets. Logistic regression was not applied here to create prospectivity maps, because of its disadvantage with missing data, which affects

Discussion

The work presented here for selection of useful training samples in predictive modeling was stimulated by the work of San Pedro and Burstein (2003) in the context of tropical cyclone prediction. They applied the concept of case-based reasoning as an effective means of solving a present problem by selecting and using past cases that closely match the current problem situation (Aamodt and Plaza, 1994). San Pedro and Burstein (2003) used fuzzy mathematics as an expert-driven tool for

Conclusions

  • Analysis of spatial associations between univariate geoscience spatial data and deposit/non-deposit locations using cumulative spatial frequency distributions is a satisfactory preliminary step in the selection of coherent deposit-type locations.

  • Logistic regression is an efficient quantitative model for classification and selection of deposit-type locations with coherent or similar multivariate spatial data signatures.

  • Application of coherent deposit-type locations reduces uncertainty and

References (63)

  • R. Black et al.

    Cratons, mobile belts, alkaline rocks and continental lithospheric mantle; the Pan-African testimony

    Journal of the Geological Society

    (1993)
  • G.F. Bonham-Carter

    Geographic Information Systems for Geoscientists, Modelling with GIS

    (1994)
  • G.F. Bonham-Carter et al.

    Weights of evidence modeling: a new approach to mapping mineral potential

  • B.N. Boots et al.

    Point Pattern Analysis

  • W.M. Brown et al.

    Artificial neural networks: a new method for mineral prospectivity mapping

    Australian Journal of Earth Sciences

    (2000)
  • N.E. Breslow et al.

    Logistic regression for two-stage case-control data

    Biometrika

    (1988)
  • E.J.M. Carranza et al.

    Logistic regression for geologically constrained mapping of gold potential, Baguio district, Philippines

    Exploration and Mining Geology

    (2001)
  • E.J.M. Carranza et al.

    Where are porphyry copper deposits spatially localized? A case study in Benguet province, Philippines

    Natural Resources Research

    (2002)
  • E.J.M. Carranza et al.

    Application of data-driven evidential belief functions to prospectivity mapping for aquamarine-bearing pegmatites, Lundazi district, Zambia

    Natural Resources Research

    (2005)
  • C.F. Chung et al.

    Regression models for estimating mineral resources from geological map data

    Mathematical Geology

    (1980)
  • D.P. Cox et al.

    Distribution of Gold in Porphyry Copper Deposits

    (1988)
  • D.R. Cox et al.

    Analysis of Binary Data

    (1989)
  • J.C. Davis

    Statistics and Data Analysis in Geology

    (1973)
  • K.M. Dawson et al.

    Regional metallogeny

  • A.P. Dempster

    Upper and lower probabilities induced by a multivalued mapping

    Annals of Mathematical Statistics

    (1967)
  • A.P. Dempster

    Generalization of Bayesian inference

    Journal of the Royal Statistical Society

    (1968)
  • P.J. Diggle

    Statistical Analysis of Spatial Point Patterns

    (1983)
  • L.J. Drew et al.

    Application of the porphyry copper/polymetallic vein kin deposit system to mineral-resource assessment in the Mátra Mountain, northern Hungary

  • H. Gabrielse et al.

    Morphogeological belts, tectonic assemblages, and terranes, Part A

  • Geoscience Data Repository

    Canadian Aeromagnetic Data Base. Geological Survey of Canada, Earth Sciences Sector, Natural Resources Canada, Government of Canada

  • Geoscience Data Repository

    Canadian Geodetic Information System. Geological Survey of Canada, Earth Sciences Sector, Natural Resources Canada, Government of Canada

  • Cited by (164)

    View all citing articles on Scopus
    View full text