Skip to main content
Erschienen in: GeoInformatica 1/2008

01.03.2008

Techniques for Computing Fitness of Use (FoU) for Time Series Datasets with Applications in the Geospatial Domain

verfasst von: Lei Fu, Leen-Kiat Soh, Ashok Samal

Erschienen in: GeoInformatica | Ausgabe 1/2008

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Time series data are widely used in many applications including critical decision support systems. The goodness of the dataset, called the Fitness of Use (FoU), used in the analysis has direct bearing on the quality of the information and knowledge generated and hence on the quality of the decisions based on them. Unlike traditional quality of data which is independent of the application in which it is used, FoU is a function of the application. As the use of geospatial time series datasets increase in many critical applications, it is important to develop formal methodologies to compute their FoU and propagate it to the derived information, knowledge and decisions. In this paper we propose a formal framework to compute the FoU of time series datasets. We present three different techniques using the Dempster–Shafer belief theory framework as the foundation. These three approaches investigate the FoU by focusing on three aspects of data: data attributes, data stability, and impact of gap periods, respectively. The effectiveness of each approach is shown using an application in hydrological datasets that measure streamflow. While we use hydrological information analysis as our application domain in this research, the techniques can be used in many other domains as well.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat J.L. Goodall, D.R. Maidment, and J. Sorenson. “Representation of spatial and temporal data,” in ArcGIS, AWRA GIS and Water Resources III Conference, Nashville, TN, 2004. J.L. Goodall, D.R. Maidment, and J. Sorenson. “Representation of spatial and temporal data,” in ArcGIS, AWRA GIS and Water Resources III Conference, Nashville, TN, 2004.
3.
Zurück zum Zitat X. Yao. “Research issues in spatio-temporal data mining,” in University Consortium for Geographic Information Science (UCGIS) Workshop on Geospatial Visualization and Knowledge Discovery. Lansdowne, Virginia (White Paper), Nov. 18–20, 2003. X. Yao. “Research issues in spatio-temporal data mining,” in University Consortium for Geographic Information Science (UCGIS) Workshop on Geospatial Visualization and Knowledge Discovery. Lansdowne, Virginia (White Paper), Nov. 18–20, 2003.
4.
Zurück zum Zitat Meta Group. Data Warehouse Scorecard. Meta Group, 1999. Meta Group. Data Warehouse Scorecard. Meta Group, 1999.
5.
Zurück zum Zitat U. Grimmer and H. Hinrichs. “A methodological approach to data quality management supported by data mining,” in Proc. of the 6th International Conference on Information Quality (IQ 2001), 2001. U. Grimmer and H. Hinrichs. “A methodological approach to data quality management supported by data mining,” in Proc. of the 6th International Conference on Information Quality (IQ 2001), 2001.
6.
Zurück zum Zitat G. Shafer. A Mathematical Theory of Evidence. Princeton University Press: Princeton, NJ, 1976. G. Shafer. A Mathematical Theory of Evidence. Princeton University Press: Princeton, NJ, 1976.
8.
Zurück zum Zitat A. Gelman. Bayesian Data Analysis. CRC Press: Boca Raton, FL, 2004. A. Gelman. Bayesian Data Analysis. CRC Press: Boca Raton, FL, 2004.
9.
Zurück zum Zitat Y.W. Lee and D.M. Strong. “Knowing—why about data processes and data quality,” Journal of Management Information Systems, Vol. 20(3):13–39, 2003–2004, winter. Y.W. Lee and D.M. Strong. “Knowing—why about data processes and data quality,” Journal of Management Information Systems, Vol. 20(3):13–39, 2003–2004, winter.
10.
Zurück zum Zitat R.Y. Yang, M.P. Ready, and H.B. Kon. “Toward quality data: an attribute-based approach,” Decision Support Systems, Vol. 12:349–372, 1995. R.Y. Yang, M.P. Ready, and H.B. Kon. “Toward quality data: an attribute-based approach,” Decision Support Systems, Vol. 12:349–372, 1995.
11.
Zurück zum Zitat L.L. Pipino, Y.W. Lee, and R.Y. Wang. “Data quality assessment,” Communications of ACM, Vol. 45:211–218, 2002, April. L.L. Pipino, Y.W. Lee, and R.Y. Wang. “Data quality assessment,” Communications of ACM, Vol. 45:211–218, 2002, April.
12.
Zurück zum Zitat D.P. Ballou and H.L. Pazer. “Modeling data and process quality in multi-input, multi-output information system,” Management Science, Vol. 31(2):150–162, 1985.CrossRef D.P. Ballou and H.L. Pazer. “Modeling data and process quality in multi-input, multi-output information system,” Management Science, Vol. 31(2):150–162, 1985.CrossRef
13.
Zurück zum Zitat K. Huang, Y.W. Lee, and R.Y. Wang. Quality Information and Knowledge. Prentice Hall: Upper Saddle River, NJ, 1999. K. Huang, Y.W. Lee, and R.Y. Wang. Quality Information and Knowledge. Prentice Hall: Upper Saddle River, NJ, 1999.
14.
Zurück zum Zitat A.X. Zhu. “Research issues on uncertainty in geographic data and GIS-based analysis,” in Research Agenda for Geographic Information Science, pp. 197–223, 2004. A.X. Zhu. “Research issues on uncertainty in geographic data and GIS-based analysis,” in Research Agenda for Geographic Information Science, pp. 197–223, 2004.
15.
Zurück zum Zitat M.P. Lynch and A.J. Saalfeld. “Conflation: Automated map compilation—a video game approach,” in Proc. of Auto-Carto 7, Falls Church, VA, 1985. M.P. Lynch and A.J. Saalfeld. “Conflation: Automated map compilation—a video game approach,” in Proc. of Auto-Carto 7, Falls Church, VA, 1985.
16.
Zurück zum Zitat H. Foley, F. Petty, M. Cobb, and K.B. Shaw. “Utilization of an expert system for the analysis of semantic characteristics for improved conflation in geographic information system,” in Proc. of the 10th International Conference on Industrial and Engineering Applications of AI, pp. 267–275, Atlanta, GA, 1997. H. Foley, F. Petty, M. Cobb, and K.B. Shaw. “Utilization of an expert system for the analysis of semantic characteristics for improved conflation in geographic information system,” in Proc. of the 10th International Conference on Industrial and Engineering Applications of AI, pp. 267–275, Atlanta, GA, 1997.
17.
Zurück zum Zitat NCGIA. A research agenda for geographic information and analysis. Technical Report 92-7, 1992. NCGIA. A research agenda for geographic information and analysis. Technical Report 92-7, 1992.
18.
Zurück zum Zitat M.F. Goodchild and S. Gopal. Accuracy of Spatial Databases. Taylor and Francis: London, 1990. M.F. Goodchild and S. Gopal. Accuracy of Spatial Databases. Taylor and Francis: London, 1990.
19.
Zurück zum Zitat M. Blakemore. “Generalization and error in spatial databases,” Cartographica, Vol. 21:131–139, 1983. M. Blakemore. “Generalization and error in spatial databases,” Cartographica, Vol. 21:131–139, 1983.
20.
Zurück zum Zitat N.R. Chrisman and M.K. Lester. “A diagnostic test for error in categorical maps, Auto-Carto 10,” in Technical Papers of the 1991 ACSM-ASPRS Annual Convention, Vol. 6, pp. 330–348, Baltimore, MD, 1991. N.R. Chrisman and M.K. Lester. “A diagnostic test for error in categorical maps, Auto-Carto 10,” in Technical Papers of the 1991 ACSM-ASPRS Annual Convention, Vol. 6, pp. 330–348, Baltimore, MD, 1991.
21.
Zurück zum Zitat P.F. Fisher. “Models of uncertainty in spatial data,” in P.A. Longley, M.F. Goodchild, D.J. Maguire, and D.W. Rhind (Eds.), Geographical Information System: Principles and Technical Issues, 191–205, Wiley: New York, 1999. P.F. Fisher. “Models of uncertainty in spatial data,” in P.A. Longley, M.F. Goodchild, D.J. Maguire, and D.W. Rhind (Eds.), Geographical Information System: Principles and Technical Issues, 191–205, Wiley: New York, 1999.
22.
Zurück zum Zitat A.X. Zhu. “Measuring uncertainty in class assignment for natural resource maps using a similarity model,” Photogrammetric Engineering and Remote Sensing, Vol. 63:1195–1202, 1997. A.X. Zhu. “Measuring uncertainty in class assignment for natural resource maps using a similarity model,” Photogrammetric Engineering and Remote Sensing, Vol. 63:1195–1202, 1997.
23.
Zurück zum Zitat S.C. Guptill and J.L. Morrison. Elements of Spatial Data Quality. Elsevier: Tarrytown, NY, 1995. S.C. Guptill and J.L. Morrison. Elements of Spatial Data Quality. Elsevier: Tarrytown, NY, 1995.
25.
Zurück zum Zitat J. Hipp, U. Güntzer, and U. Grimmer. “Data quality mining — making a virtue of necessity,” in Proc. of the 6th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2001), pp. 52–57, Santa Barbara, California, 2001. J. Hipp, U. Güntzer, and U. Grimmer. “Data quality mining — making a virtue of necessity,” in Proc. of the 6th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2001), pp. 52–57, Santa Barbara, California, 2001.
26.
Zurück zum Zitat R. Srikant and R. Agrawal. “Mining generalized association rules,” in Proc. of 21st VLDC Conference, 1995. R. Srikant and R. Agrawal. “Mining generalized association rules,” in Proc. of 21st VLDC Conference, 1995.
27.
Zurück zum Zitat D. Luebbers, U. Grimmer, and M. Jarke. “Systematic development of data mining-based data quality tools,” in Proc. of the 29th VLDB Conference, Berlin, Germany, 2003. D. Luebbers, U. Grimmer, and M. Jarke. “Systematic development of data mining-based data quality tools,” in Proc. of the 29th VLDB Conference, Berlin, Germany, 2003.
28.
Zurück zum Zitat J. Theodore and D. Tamraparni. “Comparing massive high-dimensional data sets,” in Proc. of ACM SIGKDD Conference, 1998. J. Theodore and D. Tamraparni. “Comparing massive high-dimensional data sets,” in Proc. of ACM SIGKDD Conference, 1998.
29.
Zurück zum Zitat R.Y. Liu and K. Singh. “A quality index based on data depth and multivariate rank tests,” Journal of the American Statistical Association, Vol. 88(421):252–268 1993. R.Y. Liu and K. Singh. “A quality index based on data depth and multivariate rank tests,” Journal of the American Statistical Association, Vol. 88(421):252–268 1993.
30.
Zurück zum Zitat P. Vassiliadis, A. Vagena, S. Skiadopoulos, N. Karayannidis, and T. Sellis. “Arktos: a tool for data cleaning and transformation in data warehouse environments,” IEEE Data Engineering Bulletin, Vol. 23(4):42–47, 2000. P. Vassiliadis, A. Vagena, S. Skiadopoulos, N. Karayannidis, and T. Sellis. “Arktos: a tool for data cleaning and transformation in data warehouse environments,” IEEE Data Engineering Bulletin, Vol. 23(4):42–47, 2000.
31.
Zurück zum Zitat R.Y. Wang, H.B. Kon, and S.E. Madnick. “Data quality requirements analysis and modeling,” in Proc. of Ninth International Conference on Data Engineering, Vienna, Austria, 1993 (April). R.Y. Wang, H.B. Kon, and S.E. Madnick. “Data quality requirements analysis and modeling,” in Proc. of Ninth International Conference on Data Engineering, Vienna, Austria, 1993 (April).
32.
Zurück zum Zitat B.K. Kahn, D.M. Strong, and R.Y. Wang. “Information quality benchmark: product and service performance,” Communications of the ACM, Vol. 45(4):184–192, 2002.CrossRef B.K. Kahn, D.M. Strong, and R.Y. Wang. “Information quality benchmark: product and service performance,” Communications of the ACM, Vol. 45(4):184–192, 2002.CrossRef
33.
Zurück zum Zitat Y.W. Lee, D.M. Strong, B.K. Kahn, and R.Y. Wang. “AIMQ: A methodology for information quality assessment,” Information and Management, Vol. 40(2):133–146, 2002.CrossRef Y.W. Lee, D.M. Strong, B.K. Kahn, and R.Y. Wang. “AIMQ: A methodology for information quality assessment,” Information and Management, Vol. 40(2):133–146, 2002.CrossRef
34.
Zurück zum Zitat G. Shankaranarayanan and M. Ziad. “Managing data quality in dynamic decision environment: An information product approach,” Journal of Data Management, Vol. 14(4): 14–32, 2003. G. Shankaranarayanan and M. Ziad. “Managing data quality in dynamic decision environment: An information product approach,” Journal of Data Management, Vol. 14(4): 14–32, 2003.
35.
Zurück zum Zitat J.R. Eastman. “Uncertainty management in GIS: Decision support tools for effective use of spatial data, Chapter 18,” in C. Hunsaker, M. Goodchild, M. Friedl, and E. Case (Eds.), Spatial Uncertainty in Ecology: Implications for Remote Sensing and GIS Applications, 379–390, Springer: New York, 2001. J.R. Eastman. “Uncertainty management in GIS: Decision support tools for effective use of spatial data, Chapter 18,” in C. Hunsaker, M. Goodchild, M. Friedl, and E. Case (Eds.), Spatial Uncertainty in Ecology: Implications for Remote Sensing and GIS Applications, 379–390, Springer: New York, 2001.
38.
Zurück zum Zitat F. Cremer, E. den Breejen, and K. Schutte. “Sensor data fusion for antipersonnel land mine detection,” in Proc. of EuroFusion98, pp. 55–60, 1998, October. F. Cremer, E. den Breejen, and K. Schutte. “Sensor data fusion for antipersonnel land mine detection,” in Proc. of EuroFusion98, pp. 55–60, 1998, October.
39.
Zurück zum Zitat J. Braun. “Dempster–Shafer theory and Bayesian reasoning in multisensor data fusion, sensor fusion: architectures, algorithms and applications IV,” in Proc. of SPIE 4051, pp. 255–266, 2000. J. Braun. “Dempster–Shafer theory and Bayesian reasoning in multisensor data fusion, sensor fusion: architectures, algorithms and applications IV,” in Proc. of SPIE 4051, pp. 255–266, 2000.
40.
Zurück zum Zitat G. Mihaila, L. Raschid, and M.E. Vidal. “Querying, “quality of data” metadata,” in Proc. of the Third IEEE Meta-data Conference, Bethesda, Maryland, 1999, April. G. Mihaila, L. Raschid, and M.E. Vidal. “Querying, “quality of data” metadata,” in Proc. of the Third IEEE Meta-data Conference, Bethesda, Maryland, 1999, April.
41.
Zurück zum Zitat J.C. Giarratano and G.D. Riley. “Expert systems: principles and programming,” in Principles and Programming, 4th edn. Course Technology, 2004. J.C. Giarratano and G.D. Riley. “Expert systems: principles and programming,” in Principles and Programming, 4th edn. Course Technology, 2004.
42.
Zurück zum Zitat SAS Institute. SAS/ETS User’s Guide, Version 8. SAS Publishing: Cary, NC, 1999. SAS Institute. SAS/ETS User’s Guide, Version 8. SAS Publishing: Cary, NC, 1999.
43.
Zurück zum Zitat L.-K. Soh, A. Samal, and W. Waltman. Watershed study: correlation analysis on seven watersheds in Nebraska. Technical Report, Department of Computer Science and Engineering, University of Nebraska, 2003. L.-K. Soh, A. Samal, and W. Waltman. Watershed study: correlation analysis on seven watersheds in Nebraska. Technical Report, Department of Computer Science and Engineering, University of Nebraska, 2003.
44.
Zurück zum Zitat K.L. McGraw and M.R. Seale. “Knowledge elicitation with multiple experts: considerations and techniques,” Artificial Intelligence Review, Vol. 2(1):31–44, 2004.CrossRef K.L. McGraw and M.R. Seale. “Knowledge elicitation with multiple experts: considerations and techniques,” Artificial Intelligence Review, Vol. 2(1):31–44, 2004.CrossRef
Metadaten
Titel
Techniques for Computing Fitness of Use (FoU) for Time Series Datasets with Applications in the Geospatial Domain
verfasst von
Lei Fu
Leen-Kiat Soh
Ashok Samal
Publikationsdatum
01.03.2008
Verlag
Springer US
Erschienen in
GeoInformatica / Ausgabe 1/2008
Print ISSN: 1384-6175
Elektronische ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-007-0025-0

Weitere Artikel der Ausgabe 1/2008

GeoInformatica 1/2008 Zur Ausgabe