Spatial analysis of remote sensing image classification accuracy

doi:10.1016/j.rse.2012.09.005

Remote Sensing of Environment

Volume 127, December 2012, Pages 237-246

https://doi.org/10.1016/j.rse.2012.09.005 Get rights and content

Abstract

The error matrix is the most common way of expressing the accuracy of remote sensing image classifications, such as land cover. However, it and the measures that can be calculated from it have been criticised for not providing any indication of the spatial distribution of errors. Other research has identified the need for methods to analyse the spatial non-stationarity of error and to visualise the spatial variation in classification uncertainty. This research uses geographically weighted approaches to model the spatial variations in the accuracy of both (crisp) Boolean and (soft) fuzzy land cover classes. Remotely sensed data were classified using a maximum likelihood classifier and a fuzzy classifier to predict Boolean and fuzzy land cover classes respectively. Field data were collected at sub-pixel locations and used to generate soft and crisp validation data. A Geographically Weighted Regression was used to analyse spatial variations in the relationships between observations of Boolean land cover in the field and land cover classified from remote sensing imagery. A geographically weighted difference measure was used to analyse spatial variations in fuzzy land cover accuracy. Maps of the spatial distribution of accuracy were created for fuzzy and Boolean classes. This research demonstrates that data collected as part of a standard remote sensing validation exercise can be used to estimate mapped, spatial distributions of accuracy that would augment standard accuracy measures reported in the error matrix. It suggests that geographically weighted approaches, and the spatially explicit representations of accuracy they support, offer the opportunity to report land cover accuracy in a more informative way.

Highlights

► The confusion matrix provides no information on the spatial distribution of errors. ► The spatial distribution of correspondence provides richer accuracy information. ► Geographically weighted models were used to map Boolean and Fuzzy accuracy. ► This is a methodological advance in accuracy assessment in remote sensing.

Introduction

Land cover information can be generated through the classification of remotely sensed data. Areas or pixels with similar spectral characteristics are allocated to classes or categories each of which represents a different type of land cover feature. It is a process of generalisation, and involves a number of choices about image type, resolution, number and types of classes, training sites, etc. (Campbell, 2007). Assessing map accuracy in an objective manner is fundamental to most land cover mapping projects (Foody, 2002, Strahler et al., 2006). The accepted paradigm for doing this is through comparison with some alternative data in order to determine measures of accuracy which “express the degree of ‘correctness’ of a map or classification” (Foody, 2002, p186). Determining the accuracy of land cover classified from remotely sensed data is important. Land cover is an input into environmental models incorporating land-atmosphere interactions (GLP, 2005) and land cover change is a major variable in climate change analyses (Feddema et al., 2005). In this context, accuracy descriptions can help the user to assess the uncertainties associated with incorporating land cover data into their model or to decide between land cover datasets, especially where there is a choice between data with different thematic or spatial characteristics (See & Fritz, 2006). Thus accuracy is one of the key aspects of any remotely sensed data product.

The most common approach for assessing thematic map accuracy is to compare the classified land cover with alternative but spatially and temporally coincident data, which are considered to be of higher accuracy. A sample of the land cover data created by the remote sensing analysis (here referred to as ‘classified’ data) is compared against some validation data (here referred to as ‘reference’ data). The resulting cross tabulation of classified data against reference data is commonly known as the error matrix, but in the literature is also called the confusion, contingency or validation matrix. The cross tabulation provided by the error matrix allows a number of standard reporting measures to be calculated including overall accuracy as well as user's and producer's accuracies (Congalton, 1991, Congalton and Green, 1999). These accuracy statistics provide measures of the reliability of the classified data and the degree (but not spatial extent) to which they are correct. Therefore the appropriateness of the information conveyed by the error matrix may be limited when specific local conditions vary, for example when non-stationary error distributions occur, or in the presence of heteroscedastic residual distributions — i.e. when sub-sets of the data vary from the overall trend (Stehman, 2000, Stehman, 2006).

There are two related limitations associated with accuracy assessments and error summaries calculated from the error matrix (McGwire & Fisher, 2001):

1)
The error matrix and the accuracy measures it supports provide no information about the spatial distribution of error;
2)
The overall accuracy measures derived from the error matrix may be inappropriate for sub-regions, where local error rates may be much larger or smaller than the global measures.

Overcoming such problems is important because many users of land cover data may be interested only in a particular subset of the data, either relating to a specific locale or to specific classes.

Some work in the remote sensing literature has explored the spatial distribution of different types of error and methods for reporting it. Campbell (1981) compared Landsat multispectral scanner images in the same growing season and found that misclassified pixels tended to be clustered. Congalton (1988) applied a Getis and Ord approach to analyse join count statistics to compare two datasets. Steele et al. (1998) used kriging to provide an optimal interpolation of map error. McGwire and Fisher (2001) recommended the use of Monte Carlo approaches to model the spatial distribution of errors. Some more recent research has examined the variability or non-stationarity of the distribution of errors. Riemann et al. (2010) describe a number of metrics for characterising the accuracy of spatial data that are dependent on reference data properties and Foody (2005) estimated local accuracy measures by interpolating the outputs of confusion matrices calculated at regular spaced intervals. Current validation and accuracy techniques in remote sensing have largely ignored the advances supported by such methods.

This research is in the spirit of Foody (2005). It uses Geographically Weighted Regression (GWR), a statistical method that explicitly deals with spatial non-stationarity (Brunsdon et al., 1996, Fotheringham et al., 2002), and a geographically weighted difference measure to analyse the spatial variations in the relationship between reference data and classified data for Boolean and fuzzy classes respectively. Geographically weighted approaches estimate spatially distributed measures of accuracy that are more informative than those provided by the confusion matrix. The paper proceeds as follows. Section 2 describes some of the scientific background to error matrices and their use in Boolean and fuzzy classifications. The methods and GWR are described in Section 3. Section 4 presents the results before a discussion of the issues arising from this research (Section 5) and some conclusions are drawn (Section 6).

Section snippets

Background

It is typical for the quality of spatial data such as land cover from remotely sensed imagery to be described using measures of thematic accuracy. The origins of the requirement of at least 85% thematic map accuracy can be traced back to Anderson (1971). Although the scientific basis for this accuracy level has been criticised (Congalton and Green, 1999, Pontius and Millones, 2011), it is historically related to land information being used for taxation assessments (Fisher, 1991, Fisher et al.,

Data and study area

The area of the present study is located in the north western part of Libya in Jifara Plain, around Tripoli. Satellite imagery from the Système Pour l'Observation de la Terre (SPOT) 5 sensor from 2009 was resampled to 30 m × 30 m as part of a wider study examining land cover changes using Landsat data from 1976, 1989 and 2005. It was classified into 6 classes: Urban, Woodland, Vegetation, Grazing Land and Bare areas and Water. Water was not a focus for this research. The class descriptions are

Results

The validation data were used to construct a standard error matrix (Table 3). The crisp, Boolean data allow a straightforward comparison between data classified from the remote sensing imagery and the reference data collected in the field, as well as user's and producer's accuracies to be calculated. Whilst it is evident that some classes are more reliably classified than others, the table provides no information about the spatial distribution of either overall error or errors for different

Discussion

The major contributions of this research relate to the development of 1) spatially distributed measures of accuracy using a kernel and distance weighting in geographically weighted accuracy measures; 2) a portmanteau accuracy measure for Boolean land cover data; and 3) a fuzzy difference measure to describe the accuracy of fuzzy classifications. The outputs of the Boolean analysis, using a portmanteau measure of accuracy, indicate the spatial variation in the extent to which the reference

Conclusions

This research uses geographically weighted approaches to describe the spatial variation in the accuracy of Boolean and fuzzy classifications of remotely sensed data. It proposes a portmanteau approach to describe Boolean land cover accuracy and fuzzy difference measures to describe the accuracy of fuzzy land cover. It addresses two long-standing gaps in the analysis and communication of accuracy and error land cover. First, by analysing the spatial distribution of errors it provides a better

Acknowledgements

The authors would like to thank the anonymous reviewers whose meticulous consideration of the earlier drafts and related comments have resulted in a much improved paper.

References (50)

C. Brunsdon et al.
Geographically weighted summary statistics — A framework for localized exploratory data analysis
Computers Environment and Urban Systems
(2002)
D.M. Chen et al.
The effect of spatial autocorrelation and class proportion on the accuracy measures from different sampling designs
ISPRS Journal of Photogrammetry and Remote Sensing
(2009)
R.G. Congalton
A review of assessing the accuracy of classifications of remotely sensed data
Remote Sensing of Environment
(1991)
I. Dronova et al.
Object-based analysis and change detection of major wetland cover types and their classification uncertainty during the low water period at Poyang Lake, China
Remote Sensing of Environment
(2011)
P.F. Fisher
Remote sensing of land cover classes as type 2 Fuzzy sets
Remote Sensing of Environment
(2010)
G.M. Foody
Status of land cover classification accuracy assessment
Remote Sensing of Environment
(2002)
G.M. Foody
Geographical weighting as a further refinement to regression modelling: An example focused on the NDVI–rainfall relationship
Remote Sensing of Environment
(2003)
P. Gonzalez et al.
Forest carbon densities and uncertainties from Lidar, QuickBird, and field measurements in California
Remote Sensing of Environment
(2010)
T. Phillips et al.
Modeling moulin distribution on Sermeq Avannarleq glacier using ASTER and WorldView imagery and Fuzzy set theory
Remote Sensing of Environment
(2011)
R. Riemann et al.
An effective assessment protocol for continuous geospatial datasets of forest characteristics using USFS Forest Inventory and Analysis (FIA) data
Remote Sensing of Environment
(2010)

D. Rocchini

While Boolean sets non-gently rip: A theoretical framework on fuzzy sets for mapping landscape patterns

Ecological Complexity

(2010)

B.M. Steele et al.

Estimation and mapping of misclassification probabilities for thematic land cover maps

Remote Sensing of Environment

(1998)

S.V. Stehman

Practical implications of design-based sampling inference for thematic map accuracy assessment

Remote Sensing of Environment

(2000)

J.R. Anderson

Land use classification schemes used in selected recent geographic applications of remote sensing

Photogrammetric Engineering

(1971)

C. Arnot et al.

Mapping the ecotone with Fuzzy sets

C.F. Brunsdon et al.

Geographically weighted regression — A method for exploring spatial non-stationarity

Geographical Analysis

(1996)

J. Campbell

Spatial correlation effects upon accuracy of supervised classification of land cover

Photogrammetric Engineering and Remote Sensing

(1981)

J.B. Campbell

Introduction to remote sensing

(2007)

A.J. Comber et al.

Using semantics to clarify the conceptual confusion between land cover and land use: The example of ‘forest’

Journal of Land use Science

(2008)

R.G. Congalton

Using spatial auto-correlation analysis to explore the errors in maps generated from remotely sensed data

Photogrammetric Engineering and Remote Sensing

(1988)

R.G. Congalton et al.

Assessing the accuracy of remotely sensed data: Principles and practices

(1999)

J.J. Feddema et al.

The importance of land-cover change in simulating future climates

Science

(2005)

P.F. Fisher

Modelling soil map‐unit inclusions by Monte Carlo simulation

International Journal of Geographical Information Systems

(1991)

P.F. Fisher

The pixel: A snare and a delusion

International Journal of Remote Sensing

(1997)

P.F. Fisher

Improved modelling of elevation error with geostatistics

GeoInformatica

(1998)

Cited by (171)

Using mixed-method analytical historical ecology to map land use and land cover change for ecocultural restoration in the Klamath River Basin (Northern California)
2024, Ecological Informatics
Ecocultural restoration involves the reciprocal repair of ecosystems and revitalization of cultural practices to enhance their mutual resilience to natural and anthropogenic disturbances and climate change stressors. Resilient ecocultural systems are adapted to retain structure and function in the face of disturbances that remain within historical ranges of severity. To assist in ecocultural restoration and management, understanding how a system has historically responded to different types of disturbances is therefore invaluable in understanding how social-ecological resilience can be maintained in the face of future stressors and disturbances. However, records of disturbances and ecocultural responses can be limited for certain landscapes and human communities. In this methods paper, we demonstrate a mixed-method process for integrating oral history, field-based knowledge, archival information, and historical and contemporary aerial images to gain insight into the changes on the Klamath River in Northern California from the 1940s through 2020. We georegistered historical imagery, quantified changes between land cover classes, and contextualized these classifications with qualitative assessments of changes in larger surrounding areas. By synthesizing these data sources with field measurements, mining and other land survey maps, timber management plans, fire and flood histories, and interviews with members of the Karuk Tribe, we were able to reconstruct the land use and land cover change histories at five sites. We noted that recovery of canopy cover from fire and logging practices was faster than for flood, which was faster than recovery from mining, consistent with the relative severity of likely soil disturbance. By combining different sources of information with complementary strengths, we were able to provide managers with site-specific information on recovery from different types of disturbance. Though this approach was labor-intensive, with emerging tools for supervised classification of high-resolution imagery, mixed-method analytical historical ecology could be applied more broadly, supporting ecocultural restoration on a larger scale.
Mapping planted forest age using LandTrendr algorithm and Landsat 5–8 on the Loess Plateau, China
2024, Agricultural and Forest Meteorology
Forest age is a key parameter for understanding the water and carbon cycles and carbon sequestration potential of planted forest ecosystems. However, estimating forest age on a large scale is difficult. This study aims to determine whether the distribution of forest age can be mapped over a large area using the LandTrendr (LT) algorithm. With the LT algorithm, the initial year of forestland (breakpoints) can be determined rather accurately through the yearly trajectory of the normalized burn ratio (NBR) index in the revegetated forest. The results show that the LT algorithm is a convenient, efficient, and reliable method for identifying forests age. Moreover, a comparison with ground truth data on the Chinese Loess Plateau (LP) revealed that the overall accuracy of the error confusion matrix was 89 %, with a root-mean-square error (RMSE) of 2.14 years. Using the LT algorithm, we revealed that the forestland on the Chinese LP in 2020 was dominated by planted forests over 30 years old, accounting for approximately 78.27 %. The forestland area increased by approximately 20,386 km² from 1990 to 2020, and the increase occurs primarily around the original forestland in the eastern and southern parts of the Chinese LP. This study provides an important parameter for assessing and quantifying biomass and carbon sequestration through afforestation on the Chinese LP.
Perceived barriers and advances in integrating earth observations with water resources modeling
2024, Remote Sensing Applications: Society and Environment
Advances in computing, collection, and sharing of Earth Observations (EOs) have significantly improved the potential for integrating EO and water resources models. Inadequate observational data for the systems simulated have been a persistent limitation in developing robust water resources models. Although various EO datasets have been available for decades, they have been under-utilized for water resources modeling. This can be due to sensor and product limitations, including spatial, spectral, and temporal resolutions, and the reluctance of the water resources community to adopt the state-of-art quickly. Motivated by the dual agenda of engaging the water resources community on various aspects of integrating EOs with water resources modeling and understanding the likely factors that limit a deeper integration of EOs in water resources management, we investigated the communities' perception of water resources modeling and EO integration. This paper summarizes the findings of a web-based survey conducted at the annual ASCE-EWRI (American Society of Civil Engineers-Environmental Water Resources Institute) International Water Congress in 2022.
The analysis of responses (n = 74) identified limited spatial resolution, atmospheric and cloud interference, and lack of in-situ validation data as the highest perceived barriers to integrating EOs with water resources modeling and management. Perceptions among different groups of participants and even within the groups were different. For example, the perceived barriers often differed between researchers and non-researchers (e.g., policymakers and practitioners). There were differences in perception among the remote-sensing and water resources researchers within the research community. Even among water resources communities, disparities existed between the perceptions of respondents who also identified as knowledgeable about remote sensing and those who didn’t. These observations highlighted the need to intentionally develop a convergent group and domain to integrate the disciplines involved and capitalize on the advancements that have improved the EO for water resources management.
Evolution of soil porosity in loess-palaeosol sequences of the Ebro Valley, NE Iberia
2023, Catena
Structure development and porosity evolution are closely related key processes in the formation of soils on loess. In order to better understand changes in soil porosity with time, four loess-palaeosol sequences (LPS) of the Ebro Valley were investigated using micromorphological and multiscalar images of soil thin sections combined with luminescence dating. Scanned and high-resolution mosaic images of thin sections using circular polarized light and backscattered electron scanning images were used to characterize the structural and textural porosity. The results were compared with physical and chemical analyses, and detailed micromorphological descriptions. The main results of our research revealed that 1) the change in loess particle packing, from enaulic to porphyric coarse/fine (c/f) related distribution, is the first process giving cohesion to the loess material and 2) that soil development during stabilization periods is associated with higher bioporosity. Gypsum accumulation is also a relevant process contributing to the increase of packing porosity within channels and chambers in the studied soils. Our results provide new insights into soil formation processes in Mediterranean loess and open new fields of research such as the possibility of using the quantification of bioporosity as an additional palaeoenvironmental marker.
Per-pixel accuracy as a weighting criterion for combining ensemble of extreme learning machine classifiers for satellite image classification
2023, International Journal of Applied Earth Observation and Geoinformation
Reliable classification of satellite images is essential for various applications, including land cover and crop (LCC) mapping. In recent years, ensemble classifiers have shown remarkable success in satellite image classification as they provide solutions to combine and integrate multiple classifiers. This research presented a novel satellite image classification method called PAELM. The PAELM algorithm builds a diverse set of extreme learning machine (ELM) classifiers and combines them using the pixel-based LCC accuracy values in such a way that, for a given pixel, the most highly accurate ELM classifier among the ensemble of ELMs assigns the LCC class of that pixel. We validated PAELM on six experimental sites with varying geographical environments in the United States and compared it with three advanced machine learning classifiers, namely support vector machine, conventional ELM, and extreme gradient boosting and one advanced ensemble classifier. Our results showed that PAELM improved the accuracy of LCC mapping in comparison to the benchmark classifiers. The LCC maps generated by PAELM had an averaged overall accuracy and aggregated F1 score of 0.811 and 0.804, respectively, while these values for the most accurate benchmark classifier were 0.787 and 0.781, respectively. The results also implied that all the classifiers were sensitive to scene heterogeneity and LCC class composition, with PAELM being the least sensitive classifier to these factors. Overall, our findings suggested that PAELM was a promising approach for accurate LCC mapping, demonstrating the practicality of spatial accuracy as a weighting factor in the integration of ensemble of classifiers.
An integrated approach of deep learning convolutional neural network and google earth engine for salt storm monitoring and mapping
2023, Atmospheric Pollution Research
This study aims to develop an integrated approach of deep learning convolutional neural network (DL-CNN) and Google Earth Engine (GEE) platform for salt storm modeling and monitoring. First, we selected several ST's predisposing factors, including Land Surface Temperature (LST), soil salinity, AOD, NDWI and NDVI to train models. We then collected 957 Ground Control Points (GCPs) from the study area, which were randomly divided into training (70%) and validation (30%) datasets. Finally, ReLu, Cross-Entropy, and Adam employed as activation function, loss function and optimizer, respectively. Our findings demonstrate the efficiency of an integrated DL-CNN and GEE for monitoring salt storms (Overall Accuracy (OA) = 0.93.02, 0.92.99, 0.93.88, and 0.92.01 for years 2002, 2010, 2015 and 2021, respectively). The results also show an increase in the frequency of salt storm in the study area from 2002 to 2021. Such approach is a promising step toward understanding, controlling, and managing salt storms and recommend salt storm spatial monitoring in other favored areas with similar environmental conditions. In addition, the results of this study provide critical insights into the environmental impacts of the Lake Urmia drought and its intensive environmental impacts on the human health and wellbeing of the residents.

View all citing articles on Scopus

View full text

Spatial analysis of remote sensing image classification accuracy

Abstract

Highlights

Introduction

Section snippets

Background

Data and study area

Results

Discussion

Conclusions

Acknowledgements

Computers Environment and Urban Systems

ISPRS Journal of Photogrammetry and Remote Sensing

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Ecological Complexity

Remote Sensing of Environment

Remote Sensing of Environment

Land use classification schemes used in selected recent geographic applications of remote sensing

Photogrammetric Engineering

Mapping the ecotone with Fuzzy sets

Geographically weighted regression — A method for exploring spatial non-stationarity

Geographical Analysis

Spatial correlation effects upon accuracy of supervised classification of land cover

Photogrammetric Engineering and Remote Sensing

Introduction to remote sensing

Using semantics to clarify the conceptual confusion between land cover and land use: The example of ‘forest’

Journal of Land use Science

Using spatial auto-correlation analysis to explore the errors in maps generated from remotely sensed data

Photogrammetric Engineering and Remote Sensing

Assessing the accuracy of remotely sensed data: Principles and practices

The importance of land-cover change in simulating future climates

Science

Modelling soil map‐unit inclusions by Monte Carlo simulation

International Journal of Geographical Information Systems

The pixel: A snare and a delusion

International Journal of Remote Sensing

Improved modelling of elevation error with geostatistics

GeoInformatica