Skip to main content
Erschienen in: GeoInformatica 4/2009

01.12.2009

Web data retrieval: solving spatial range queries using k-nearest neighbor searches

verfasst von: Wan D. Bae, Shayma Alkobaisi, Seon Ho Kim, Sada Narayanappa, Cyrus Shahabi

Erschienen in: GeoInformatica | Ausgabe 4/2009

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

As Geographic Information Systems (GIS) technologies have evolved, more and more GIS applications and geospatial data are available on the web. Spatial objects in a given query range can be retrieved using spatial range query − one of the most widely used query types in GIS and spatial databases. However, it can be challenging to retrieve these data from various web applications where access to the data is only possible through restrictive web interfaces that support certain types of queries. A typical scenario is the existence of numerous business web sites that provide their branch locations through a limited “nearest location” web interface. For example, a chain restaurant’s web site such as McDonalds can be queried to find some of the closest locations of its branches to the user’s home address. However, even though the site has the location data of all restaurants in, for example, the state of California, it is difficult to retrieve the entire data set efficiently due to its restrictive web interface. Considering that k-Nearest Neighbor (k-NN) search is one of the most popular web interfaces in accessing spatial data on the web, this paper investigates the problem of retrieving geospatial data from the web for a given spatial range query using only k-NN searches. Based on the classification of k-NN interfaces on the web, we propose a set of range query algorithms to completely cover the rectangular shape of the query range (completeness) while minimizing the number of k-NN searches as possible (efficiency). We evaluated the efficiency of the proposed algorithms through statistical analysis and empirical experiments using both synthetic and real data sets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Actually, overestimation introduce a slight overhead in each cost function but it is way lower than the overhead caused by underestimation.
 
2
The terms, CDT and DCDT, are used for both the corresponding triangulation algorithms and the data structures that support these algorithms in this chapter.
 
Literatur
1.
Zurück zum Zitat Barish G, Chen Y, Dipasquo D, Knoblock CA, Minton S, Muslea I, Shahabi C (2000) Theaterloc: using information integration technology to rapidly build virtual application. In: Proceedings of international conf. on data engineering (ICDE), 28 February–3 March 2000, San Diego, pp 681–682 Barish G, Chen Y, Dipasquo D, Knoblock CA, Minton S, Muslea I, Shahabi C (2000) Theaterloc: using information integration technology to rapidly build virtual application. In: Proceedings of international conf. on data engineering (ICDE), 28 February–3 March 2000, San Diego, pp 681–682
2.
Zurück zum Zitat Bae WD, Alkobaisi S, Kim SH, Narayanappa S, Shahabi C (2007) Supporting range queries on web data using k-nearest neighbor search. In: Proceedings of the 7th international symposium on web and wireless GIS (W2GIS 2007), 28–29 November 2007, Cardiff, pp 61–75 Bae WD, Alkobaisi S, Kim SH, Narayanappa S, Shahabi C (2007) Supporting range queries on web data using k-nearest neighbor search. In: Proceedings of the 7th international symposium on web and wireless GIS (W2GIS 2007), 28–29 November 2007, Cardiff, pp 61–75
3.
Zurück zum Zitat Bae WD, Alkobaisi S, Kim SH, Narayanappa S, Shahabi C (2007) Supporting range queries on web data using k-nearest neighbor search. Technical report DU-CS-08-01, University of Denver Bae WD, Alkobaisi S, Kim SH, Narayanappa S, Shahabi C (2007) Supporting range queries on web data using k-nearest neighbor search. Technical report DU-CS-08-01, University of Denver
4.
Zurück zum Zitat Byers S, Freire J, Silva C (2001) Efficient acquisition of web data through restricted query interface. In: Poster proceedings of the world wide web conference (WWW10), Hong Kong, 1–5 May 2001, pp 184–185 Byers S, Freire J, Silva C (2001) Efficient acquisition of web data through restricted query interface. In: Poster proceedings of the world wide web conference (WWW10), Hong Kong, 1–5 May 2001, pp 184–185
5.
Zurück zum Zitat Chew LP (1989) Constrained Delaunay triangulations. Algorithmica 4(1):97–108CrossRef Chew LP (1989) Constrained Delaunay triangulations. Algorithmica 4(1):97–108CrossRef
6.
Zurück zum Zitat Dickerson M, Drysdale R, Sack J (1992) Simple algorithms for enumerating interpoint distances and finding k nearest neighbors. Int J Comput Geom Appl 2:221–239CrossRef Dickerson M, Drysdale R, Sack J (1992) Simple algorithms for enumerating interpoint distances and finding k nearest neighbors. Int J Comput Geom Appl 2:221–239CrossRef
7.
Zurück zum Zitat Eppstein D, Erickson J (1994) Interated nearest neighbors and finding minimal polytypes. Discrete Comput Geom 11:321–350CrossRef Eppstein D, Erickson J (1994) Interated nearest neighbors and finding minimal polytypes. Discrete Comput Geom 11:321–350CrossRef
8.
Zurück zum Zitat Gaede V, Gounter O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231CrossRef Gaede V, Gounter O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231CrossRef
9.
Zurück zum Zitat Hieu LQ (2005) Integration of web data sources: a survey of existing problems. In: Proceedings of the 17th GI-workshop on the foundations of databases (GvD), Wörlitz, 17–20 May 2005 Hieu LQ (2005) Integration of web data sources: a survey of existing problems. In: Proceedings of the 17th GI-workshop on the foundations of databases (GvD), Wörlitz, 17–20 May 2005
10.
Zurück zum Zitat Liu D, Lim E, Ng W (2002) Efficient k nearest neighbor queries on remote spatial databases using range estimation. In: Proceedings of international conf. on scientific and statistical databases management (SSDMB), Edinburgh, 24–26 July 2002, pp 121–130 Liu D, Lim E, Ng W (2002) Efficient k nearest neighbor queries on remote spatial databases using range estimation. In: Proceedings of international conf. on scientific and statistical databases management (SSDMB), Edinburgh, 24–26 July 2002, pp 121–130
11.
Zurück zum Zitat Mergerian S, Koushanfar F (2005) Worst and best-case coverage in sensor networks. IEEE Trans Mob Comput 4(1):84–92CrossRef Mergerian S, Koushanfar F (2005) Worst and best-case coverage in sensor networks. IEEE Trans Mob Comput 4(1):84–92CrossRef
12.
Zurück zum Zitat Nie Z, Kambhampati S, Nambiar U (2005) Effectively mining and using coverage and overlap statistics for data integration. IEEE Trans Knowl Data Eng 17(5):638–651, MayCrossRef Nie Z, Kambhampati S, Nambiar U (2005) Effectively mining and using coverage and overlap statistics for data integration. IEEE Trans Knowl Data Eng 17(5):638–651, MayCrossRef
13.
Zurück zum Zitat Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: Proceedings of ACM SIGMOD, San Jose, May 1995, pp 71–79 Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: Proceedings of ACM SIGMOD, San Jose, May 1995, pp 71–79
14.
Zurück zum Zitat Samet H (1985) Data structures for quadtree approximation and compression. Commun ACM 28(9):973–993, SeptemberCrossRef Samet H (1985) Data structures for quadtree approximation and compression. Commun ACM 28(9):973–993, SeptemberCrossRef
15.
Zurück zum Zitat Sharifzadeh M, Shahabi C (2006) Utilizing voronoi cells of location data streams for accurate computation of aggregate functions in sensor networks. GeoInformatica 10(1):9–36CrossRef Sharifzadeh M, Shahabi C (2006) Utilizing voronoi cells of location data streams for accurate computation of aggregate functions in sensor networks. GeoInformatica 10(1):9–36CrossRef
16.
Zurück zum Zitat Song Z, Roussonpoulos N (2001) K-nearest neighbor search for moving query point. In: Proceedings of international symposium on spatial and temporal databases (SSTD), Redondo Beach, 12–15 July 2001, pp 79–96 Song Z, Roussonpoulos N (2001) K-nearest neighbor search for moving query point. In: Proceedings of international symposium on spatial and temporal databases (SSTD), Redondo Beach, 12–15 July 2001, pp 79–96
17.
Zurück zum Zitat Tao U, Zhang U, Papadias D, Mamoulis N (2004) An efficient cost model for optimization of nearest neighbor search in low and medium dimensional spaces. IEEE Trans Knowl Data Eng 16(10):1169–1184, OctoberCrossRef Tao U, Zhang U, Papadias D, Mamoulis N (2004) An efficient cost model for optimization of nearest neighbor search in low and medium dimensional spaces. IEEE Trans Knowl Data Eng 16(10):1169–1184, OctoberCrossRef
19.
Zurück zum Zitat Wang G, Cao G, Porta TL (2003) Movement-assisted sensor deployment. In: Proceedings of IEEE INFOCOM, San Francisco, 30 March–3 April 2003, pp 2469–2479 Wang G, Cao G, Porta TL (2003) Movement-assisted sensor deployment. In: Proceedings of IEEE INFOCOM, San Francisco, 30 March–3 April 2003, pp 2469–2479
20.
Zurück zum Zitat Wang S, Armstrong MP (2003) A quadtree approach to domain decomposition for spatial interpolation in grid computing environments. Parallel Comput 29(3):1481–1504, AprilCrossRef Wang S, Armstrong MP (2003) A quadtree approach to domain decomposition for spatial interpolation in grid computing environments. Parallel Comput 29(3):1481–1504, AprilCrossRef
21.
Zurück zum Zitat Wu C, Lee K, Chung Y (2006) A Delaunay triangulation based method for wireless sensor network deployment. In: Proceedings of ICPADS, Minneapolis, July 2006, pp 253–260 Wu C, Lee K, Chung Y (2006) A Delaunay triangulation based method for wireless sensor network deployment. In: Proceedings of ICPADS, Minneapolis, July 2006, pp 253–260
22.
Zurück zum Zitat Yerneni R, Li C, Garcia-Molina H, Ullman J (1999) Computing capabilities of mediators. In: Proceedings of SIGMOD, Philadelphia, 1–3 June 1999, pp 443–454 Yerneni R, Li C, Garcia-Molina H, Ullman J (1999) Computing capabilities of mediators. In: Proceedings of SIGMOD, Philadelphia, 1–3 June 1999, pp 443–454
Metadaten
Titel
Web data retrieval: solving spatial range queries using k-nearest neighbor searches
verfasst von
Wan D. Bae
Shayma Alkobaisi
Seon Ho Kim
Sada Narayanappa
Cyrus Shahabi
Publikationsdatum
01.12.2009
Verlag
Springer US
Erschienen in
GeoInformatica / Ausgabe 4/2009
Print ISSN: 1384-6175
Elektronische ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-008-0055-2

Weitere Artikel der Ausgabe 4/2009

GeoInformatica 4/2009 Zur Ausgabe