Skip to main content
Erschienen in: Social Network Analysis and Mining 4/2013

01.12.2013 | Original Article

Hypergraph index: an index for context-aware nearest neighbor query on social networks

verfasst von: Yazhe Wang, Baihua Zheng

Erschienen in: Social Network Analysis and Mining | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social network has been touted as the No. 2 innovation in a recent IEEE Spectrum Special Report on “Top 11 Technologies of the Decade”, and it has cemented its status as a bona fide Internet phenomenon. With more and more people starting using social networks to share ideas, activities, events, and interests with other members within the network, social networks contain a huge amount of content. However, it might not be easy to navigate social networks to find specific information. In this paper, we define a new type of queries, namely context-aware nearest neighbor (CANN) search over social network to retrieve the nearest node to the query node that matches the textual context specified. The textual context of a node is defined as a set of keywords that describe the important aspects of the nodes. CANN considers both the network structure and the textual context of the nodes, and it has a very broad application base. Two existing searching strategies can be applied to support CANN search. The first one performs the search based on the network distance, and the other one conducts the search based on the node context information. Each of these methods operates according to only one factor but ignores the other one. They can be very inefficient for large social networks, where one factor alone normally has a very limited pruning power. In this paper, we design a hypergraph based method to support efficient approximated CANN search via considering the network structure and nodes’ textual contexts simultaneously. Experimental results show that the hypergraph-based method provides approximated results efficiently with low preprocessing and storage costs, and is scalable to large social networks. The approximation quality of our method is demonstrated based on both theoretical proofs and experimental results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
If there is no edge between two nodes, it does not mean these nodes are not reachable but mean that edge is not needed to prove this lemma.
 
2
The additional storage consumption of \({\mathsf{SI}}\) is none.
 
Literatur
Zurück zum Zitat Ang C (2011) Interaction networks and patterns of guild community in massively multiplayer online games. Soc Netw Anal Min 1:341–353CrossRef Ang C (2011) Interaction networks and patterns of guild community in massively multiplayer online games. Soc Netw Anal Min 1:341–353CrossRef
Zurück zum Zitat Bloom BH (1970) Space/time trade-offs in hash coding with allowable errors. Commun ACM (CACM) 13(7):422–426CrossRefMATH Bloom BH (1970) Space/time trade-offs in hash coding with allowable errors. Commun ACM (CACM) 13(7):422–426CrossRefMATH
Zurück zum Zitat Carmel D, Zwerdling N, Guy I, Ofek-Koifman S, Har’el N, Ronen I, Uziel E, Yogev S, Chernov S (2009) Personalized social search based on the user’s social network. In: Proceedings of the 18th ACM conference on information and knowledge management (CIKM ’09), pp 1227–1236 Carmel D, Zwerdling N, Guy I, Ofek-Koifman S, Har’el N, Ronen I, Uziel E, Yogev S, Chernov S (2009) Personalized social search based on the user’s social network. In: Proceedings of the 18th ACM conference on information and knowledge management (CIKM ’09), pp 1227–1236
Zurück zum Zitat Chong WH, Toh WSB, Teow LN (2010) Efficient extraction of high-betweenness vertices. In: Proceedings of the 2010 international conference on advances in social networks analysis and mining (ASONAM ’10), pp 286–290 Chong WH, Toh WSB, Teow LN (2010) Efficient extraction of high-betweenness vertices. In: Proceedings of the 2010 international conference on advances in social networks analysis and mining (ASONAM ’10), pp 286–290
Zurück zum Zitat Cohen E. (1999) Fast algorithms for constructing t-spanners and paths with stretch t. SIAM J Comput 28:210–236MathSciNetCrossRef Cohen E. (1999) Fast algorithms for constructing t-spanners and paths with stretch t. SIAM J Comput 28:210–236MathSciNetCrossRef
Zurück zum Zitat Dalvi BB, Kshirsagar M, Sudarshan S (2008) Keyword search on external memory data graphs. VLDB Endow 1(1):1189–1204 Dalvi BB, Kshirsagar M, Sudarshan S (2008) Keyword search on external memory data graphs. VLDB Endow 1(1):1189–1204
Zurück zum Zitat Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41CrossRef Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41CrossRef
Zurück zum Zitat Goldberg AV (2007) Point-to-point shortest path algorithms with preprocessing. In: Proceedings of the 33rd conference on current trends in theory and practice of computer science (SOFSEM’07), pp 88–102 Goldberg AV (2007) Point-to-point shortest path algorithms with preprocessing. In: Proceedings of the 33rd conference on current trends in theory and practice of computer science (SOFSEM’07), pp 88–102
Zurück zum Zitat Goldberg AV, Harrelson C (2005) Computing the shortest path: a search meets graph theory. In: Proceedings of the sixteenth annual ACM-SIAM symposium on discrete algorithms (SODA’05), pp 156–165 Goldberg AV, Harrelson C (2005) Computing the shortest path: a search meets graph theory. In: Proceedings of the sixteenth annual ACM-SIAM symposium on discrete algorithms (SODA’05), pp 156–165
Zurück zum Zitat Gutman R. (2004) Reach-based routing: a new approach to shortest path algorithms optimized for road networks. In: Proceeding of the sixth workshop on algorithm engineering and experiments (ALENEX’04), pp 100–111 Gutman R. (2004) Reach-based routing: a new approach to shortest path algorithms optimized for road networks. In: Proceeding of the sixth workshop on algorithm engineering and experiments (ALENEX’04), pp 100–111
Zurück zum Zitat He H, Wang H, Yang J, Yu PS (2007) Blinks: ranked keyword searches on graphs. In: Proceedings of the 2007 ACM SIGMOD international conference on management of data (SIGMOD’07), pp 305–316 He H, Wang H, Yang J, Yu PS (2007) Blinks: ranked keyword searches on graphs. In: Proceedings of the 2007 ACM SIGMOD international conference on management of data (SIGMOD’07), pp 305–316
Zurück zum Zitat Hu H, Lee DL, Lee VCS (2006) Distance indexing on road networks. In: Proceedings of the 32nd international conference on Very large data bases (VLDB’06), pp 894–905 Hu H, Lee DL, Lee VCS (2006) Distance indexing on road networks. In: Proceedings of the 32nd international conference on Very large data bases (VLDB’06), pp 894–905
Zurück zum Zitat Hulgeri A, Nakhe C (2002) Keyword searching and browsing in databases using banks. In: Proceedings of the 18th international conference on data engineering (ICDE’02), pp 431–443 Hulgeri A, Nakhe C (2002) Keyword searching and browsing in databases using banks. In: Proceedings of the 18th international conference on data engineering (ICDE’02), pp 431–443
Zurück zum Zitat Jing N, Huang YW, Rundensteiner EA (1998) Hierarchical encoded path views for path query processing: an optimal model and its performance evaluation. IEEE Trans Knowl Data Eng (TKDE) 10(3):409–432CrossRef Jing N, Huang YW, Rundensteiner EA (1998) Hierarchical encoded path views for path query processing: an optimal model and its performance evaluation. IEEE Trans Knowl Data Eng (TKDE) 10(3):409–432CrossRef
Zurück zum Zitat Jung S, Pramanik S (2002) An efficient path computation model for hierarchically structured topographical road maps. IEEE Trans Knowl Data Eng (TKDE) 14(5):1029–1046CrossRef Jung S, Pramanik S (2002) An efficient path computation model for hierarchically structured topographical road maps. IEEE Trans Knowl Data Eng (TKDE) 14(5):1029–1046CrossRef
Zurück zum Zitat Kacholia V, Pandit S, Chakrabarti S, Sudarshan S, Desai R, Karambelkar H (2005) Bidirectional expansion for keyword search on graph databases. In: Proceedings of the 31st international conference on very large data bases (VLDB’05), pp 505–516 Kacholia V, Pandit S, Chakrabarti S, Sudarshan S, Desai R, Karambelkar H (2005) Bidirectional expansion for keyword search on graph databases. In: Proceedings of the 31st international conference on very large data bases (VLDB’05), pp 505–516
Zurück zum Zitat Kourtellis N, Alahakoon T, Simha R, Iamnitchi A, Tripathi R (2012) Identifying high betweenness centrality nodes in large social networks. Soc Netw Anal Min 1–16. doi:10.1007/s13278-012-0076-6 Kourtellis N, Alahakoon T, Simha R, Iamnitchi A, Tripathi R (2012) Identifying high betweenness centrality nodes in large social networks. Soc Netw Anal Min 1–16. doi:10.​1007/​s13278-012-0076-6
Zurück zum Zitat Lee D, Leng C (1989) Partitioned signature file: design considerations and performance evaluation. ACM Trans on Inform Syst (TOIS) 7(2):158–180CrossRef Lee D, Leng C (1989) Partitioned signature file: design considerations and performance evaluation. ACM Trans on Inform Syst (TOIS) 7(2):158–180CrossRef
Zurück zum Zitat Lee DL, Kim YM, Patel G (1995) Efficient signature file methods for text retrieval. IEEEIEEE Trans Knowl Data Eng (TKDE) 7(3):423–435CrossRef Lee DL, Kim YM, Patel G (1995) Efficient signature file methods for text retrieval. IEEEIEEE Trans Knowl Data Eng (TKDE) 7(3):423–435CrossRef
Zurück zum Zitat Lee KCK, Lee WC, Zheng B (2009) Fast object search on road networks. In: Proceedings of the 12th international conference on extending database technology (EDBT’09), pp 1018–1029 Lee KCK, Lee WC, Zheng B (2009) Fast object search on road networks. In: Proceedings of the 12th international conference on extending database technology (EDBT’09), pp 1018–1029
Zurück zum Zitat Lee KCK, Lee WC, Zheng B, Tian Y (2012) Road: a new spatial object search framework for road networks. IEEE Trans Knowl Data Eng (TKDE) 24(3):547–560CrossRef Lee KCK, Lee WC, Zheng B, Tian Y (2012) Road: a new spatial object search framework for road networks. IEEE Trans Knowl Data Eng (TKDE) 24(3):547–560CrossRef
Zurück zum Zitat Leng C, Lee D (1992) Optimal weight assignment for signature generation. ACM Trans Database Syst (TODS) 17(2):346–373CrossRef Leng C, Lee D (1992) Optimal weight assignment for signature generation. ACM Trans Database Syst (TODS) 17(2):346–373CrossRef
Zurück zum Zitat Li G, Feng J, Chin Ooi B, Wang J, Zhou L (2011) An effective 3-in-1 keyword search method over heterogeneous data sources. Inform Syst 36:248–266CrossRef Li G, Feng J, Chin Ooi B, Wang J, Zhou L (2011) An effective 3-in-1 keyword search method over heterogeneous data sources. Inform Syst 36:248–266CrossRef
Zurück zum Zitat Maglaras LA, Katsaros D (2012) New measures for characterizing the significance of nodes in wireless ad hoc networks via localized path-based neighborhood analysis. Soc Netw Anal Min 2:97–106 Maglaras LA, Katsaros D (2012) New measures for characterizing the significance of nodes in wireless ad hoc networks via localized path-based neighborhood analysis. Soc Netw Anal Min 2:97–106
Zurück zum Zitat Martn Gonzlez AM, Dalsgaard B, Olesen JM (2010) Centrality measures and the importance of generalist species in pollination networks. Ecol Complexity 7(1):36–43 Martn Gonzlez AM, Dalsgaard B, Olesen JM (2010) Centrality measures and the importance of generalist species in pollination networks. Ecol Complexity 7(1):36–43
Zurück zum Zitat Newman M (2005) A measure of betweenness centrality based on random walks. Soc Netw 27(1):39–54CrossRef Newman M (2005) A measure of betweenness centrality based on random walks. Soc Netw 27(1):39–54CrossRef
Zurück zum Zitat Samet H, Sankaranarayanan J, Alborzi H (2008) Scalable network distance browsing in spatial databases. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data (SIGMOD’08), pp 43–54 Samet H, Sankaranarayanan J, Alborzi H (2008) Scalable network distance browsing in spatial databases. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data (SIGMOD’08), pp 43–54
Zurück zum Zitat Schenkel R, Crecelius T, Kacimi M, Michel S, Neumann T, Parreira JX, Weikum G (2008) Efficient top-k querying over social-tagging networks. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (SIGIR’08), pp 523–530 Schenkel R, Crecelius T, Kacimi M, Michel S, Neumann T, Parreira JX, Weikum G (2008) Efficient top-k querying over social-tagging networks. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (SIGIR’08), pp 523–530
Zurück zum Zitat Vieira MV, Fonseca BM, Damazio R, Golgher PB, Reis DC, Ribeiro-Neto B (2007) Efficient search ranking in social networks. In: Proceedings of the sixteenth ACM conference on conference on information and knowledge management (CIKM’07), pp 563–572 Vieira MV, Fonseca BM, Damazio R, Golgher PB, Reis DC, Ribeiro-Neto B (2007) Efficient search ranking in social networks. In: Proceedings of the sixteenth ACM conference on conference on information and knowledge management (CIKM’07), pp 563–572
Zurück zum Zitat Wei F (2010) TEDI: efficient shortest path query answering on graphs. In: Proceedings of the 2010 international conference on management of data (SIGMOD’10), New York, NY, USA, pp 99–110 Wei F (2010) TEDI: efficient shortest path query answering on graphs. In: Proceedings of the 2010 international conference on management of data (SIGMOD’10), New York, NY, USA, pp 99–110
Zurück zum Zitat Xiao Y, Wu W, Pei J, Wang W, He Z (2009) Efficiently indexing shortest paths by exploiting symmetry in graphs. In: Proceedings of the 12th international conference on extending database technology (EDBT’09), pp 493–504 Xiao Y, Wu W, Pei J, Wang W, He Z (2009) Efficiently indexing shortest paths by exploiting symmetry in graphs. In: Proceedings of the 12th international conference on extending database technology (EDBT’09), pp 493–504
Zurück zum Zitat Yin P, Lee WC, Lee KC (2010) On top-k social web search. In: Proceedings of the 19th ACM international conference on information and knowledge management (CIKM ’10), pp 1313–1316 Yin P, Lee WC, Lee KC (2010) On top-k social web search. In: Proceedings of the 19th ACM international conference on information and knowledge management (CIKM ’10), pp 1313–1316
Metadaten
Titel
Hypergraph index: an index for context-aware nearest neighbor query on social networks
verfasst von
Yazhe Wang
Baihua Zheng
Publikationsdatum
01.12.2013
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 4/2013
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-013-0095-y

Weitere Artikel der Ausgabe 4/2013

Social Network Analysis and Mining 4/2013 Zur Ausgabe