Skip to main content
Top
Published in: GeoInformatica 4/2018

15-10-2018

Diverse nearest neighbors queries using linear skylines

Authors: Camila F. Costa, Mario A. Nascimento, Matthias Schubert

Published in: GeoInformatica | Issue 4/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

k-nearest neighbor (k-NN) queries are well-known and widely used in a plethora of applications. However, in the original definition of k-NN queries there is no concern regarding diversity of the answer set with respect to the user’s interests. For instance, travelers may be looking for touristic sites that are close to where they are, but that would also lead them to see different parts of the city. Likewise, if one is looking for restaurants close by, it may be more interesting to learn about restaurants of different categories or ethnicities which are nonetheless relatively close. The interesting novel aspect of this type of query is that there are two competing criteria to be optimized: closeness and diversity. We propose two approaches that leverage the notion of linear skyline queries in order to find the k diverse nearest neighbors within a radius r from a given query point, or (k, r)-DNNs for short. Our proposed approaches return a relatively small set containing all optimal solutions for any linear combination of the weights a user could give to the two competing criteria, and we consider three different notions of diversity: spatial, categorical and angular. Our experiments, varying a number of parameters and exploring synthetic and real datasets, in both Euclidean space and road networks, respectively, show that our approaches are several orders of magnitude faster than a straightforward approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Since it finds all possible sets of size k regardless of the diversity considered, BF’s processing time does not vary with the type of diversity; hence we omit the results for those.
 
Literature
1.
go back to reference Abbar S, Amer-Yahia S, Indyk P, Mahabadi S, Varadarajan KR (2013) Diverse near neighbor problem. In: Proceedings of the 29th Symposium on Computational Geometry, pp 207–214 Abbar S, Amer-Yahia S, Indyk P, Mahabadi S, Varadarajan KR (2013) Diverse near neighbor problem. In: Proceedings of the 29th Symposium on Computational Geometry, pp 207–214
3.
go back to reference Borzsony S, Kossmann D, Stocker K (2001) The skyline operator. In: Proceedings of 17th International Conference on Data Engineering, pp 421–430 Borzsony S, Kossmann D, Stocker K (2001) The skyline operator. In: Proceedings of 17th International Conference on Data Engineering, pp 421–430
4.
go back to reference Carbonell J, Goldstein J (1998) The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 335–336 Carbonell J, Goldstein J (1998) The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 335–336
5.
go back to reference Carterette B (2011) An analysis of np-completeness in novelty and diversity ranking. Inf Retr 14:89–106CrossRef Carterette B (2011) An analysis of np-completeness in novelty and diversity ranking. Inf Retr 14:89–106CrossRef
6.
go back to reference Clarke CL, Kolla M, Cormack GV, Vechtomova O, Ashkan A, Büttcher S, MacKinnon I (2008) Novelty and diversity in information retrieval evaluation. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 659–666 Clarke CL, Kolla M, Cormack GV, Vechtomova O, Ashkan A, Büttcher S, MacKinnon I (2008) Novelty and diversity in information retrieval evaluation. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 659–666
7.
go back to reference Costa CF, Nascimento MA (2017) Towards spatially- and category-wise k-diverse nearest neighbors queries. In: International Symposium on Spatial and Temporal Databases, pp 163–181 Costa CF, Nascimento MA (2017) Towards spatially- and category-wise k-diverse nearest neighbors queries. In: International Symposium on Spatial and Temporal Databases, pp 163–181
8.
go back to reference Gu Y, Liu G, Qi J, Xu H, Yu G, Zhang R (2016) The moving k diversified nearest neighbor query. IEEE Trans Knowl Data Eng 28:2778–2792CrossRef Gu Y, Liu G, Qi J, Xu H, Yu G, Zhang R (2016) The moving k diversified nearest neighbor query. IEEE Trans Knowl Data Eng 28:2778–2792CrossRef
10.
go back to reference Huang Z et al (2011) A clustering based approach for skyline diversity. Expert Syst Appl 38:7984–7993CrossRef Huang Z et al (2011) A clustering based approach for skyline diversity. Expert Syst Appl 38:7984–7993CrossRef
11.
go back to reference Jain A, Sarda P, Haritsa JR (2004) Providing diversity in k-nearest neighbor query results. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp 404–413CrossRef Jain A, Sarda P, Haritsa JR (2004) Providing diversity in k-nearest neighbor query results. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp 404–413CrossRef
12.
go back to reference Kucuktunc O, Ferhatosmanoglu H (2013) λ-diverse nearest neighbors browsing for multidimensional data. IEEE Trans Knowl Data Eng 25:481–493CrossRef Kucuktunc O, Ferhatosmanoglu H (2013) λ-diverse nearest neighbors browsing for multidimensional data. IEEE Trans Knowl Data Eng 25:481–493CrossRef
13.
go back to reference Lee K C, Lee W C, Leong H V (2010) Nearest surrounder queries. IEEE Trans Knowl Data Eng 22:1444–1458CrossRef Lee K C, Lee W C, Leong H V (2010) Nearest surrounder queries. IEEE Trans Knowl Data Eng 22:1444–1458CrossRef
14.
go back to reference Rafiei D, Bharat K, Shukla A (2010) . In: Proceedings of the 19th International Conference on World Wide Web, pp 781–790 Rafiei D, Bharat K, Shukla A (2010) . In: Proceedings of the 19th International Conference on World Wide Web, pp 781–790
15.
go back to reference Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: ACM SIGMOD Record, pp 71–79 Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: ACM SIGMOD Record, pp 71–79
16.
go back to reference Shekelyan M, Jossé G, Schubert M, Kriegel HP (2014) Linear path skyline computation in bicriteria networks. In: International Conference on Database Systems for Advanced Applications, pp 173–187CrossRef Shekelyan M, Jossé G, Schubert M, Kriegel HP (2014) Linear path skyline computation in bicriteria networks. In: International Conference on Database Systems for Advanced Applications, pp 173–187CrossRef
17.
go back to reference Tao Y (2009) Diversity in skylines. IEEE Data Eng Bull 32:65–72 Tao Y (2009) Diversity in skylines. IEEE Data Eng Bull 32:65–72
18.
go back to reference Valkanas G, Papadopoulos AN, Gunopulos D (2013) Skydiver: a framework for skyline diversification. In: Proceedings of the 16th International Conference on Extending Database Technology, pp 406–417 Valkanas G, Papadopoulos AN, Gunopulos D (2013) Skydiver: a framework for skyline diversification. In: Proceedings of the 16th International Conference on Extending Database Technology, pp 406–417
19.
go back to reference Vieira MR, Razente HL, Barioni MC, Hadjieleftheriou M, Srivastava D, Traina C, Tsotras VJ (2011) On query result diversification. In: IEEE 27th International Conference on Data Engineering, pp 1163–1174 Vieira MR, Razente HL, Barioni MC, Hadjieleftheriou M, Srivastava D, Traina C, Tsotras VJ (2011) On query result diversification. In: IEEE 27th International Conference on Data Engineering, pp 1163–1174
20.
go back to reference Yu C, Lakshmanan L, Amer-Yahia S (2009) It takes variety to make a world: Diversification in recommender systems. In: Proceedings of the 12th International Conference on Extending Database Technology, pp 368–378 Yu C, Lakshmanan L, Amer-Yahia S (2009) It takes variety to make a world: Diversification in recommender systems. In: Proceedings of the 12th International Conference on Extending Database Technology, pp 368–378
21.
go back to reference Zhang C, Zhang Y, Zhang W, Lin X, Cheema MA, Wang X (2014) Diversified spatial keyword search on road networks. In: Proceedings of the 17th International Conference on Extending Database Technology, pp 367–378 Zhang C, Zhang Y, Zhang W, Lin X, Cheema MA, Wang X (2014) Diversified spatial keyword search on road networks. In: Proceedings of the 17th International Conference on Extending Database Technology, pp 367–378
Metadata
Title
Diverse nearest neighbors queries using linear skylines
Authors
Camila F. Costa
Mario A. Nascimento
Matthias Schubert
Publication date
15-10-2018
Publisher
Springer US
Published in
GeoInformatica / Issue 4/2018
Print ISSN: 1384-6175
Electronic ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-018-0332-7

Other articles of this Issue 4/2018

GeoInformatica 4/2018 Go to the issue