Skip to main content

2017 | OriginalPaper | Buchkapitel

Towards a Query-Less News Search Framework on Twitter

verfasst von : Xiaotian Hao, Ji Cheng, Jan Vosecky, Wilfred Ng

Erschienen in: Database Systems for Advanced Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Twitter enables users to browse and access the latest news-related content. However, given user’s interest in a particular news-related tweet, searching for related content may be a tedious process. Formulating an effective search query is not a trivial task. And due to the often small size of smart phone screens, instead of typing, users always prefer click-based operations to retrieve related content. To address these issues, we introduce a new paradigm for news-related Twitter search called Search by Tweet(SbT). In this paradigm, a user submits a particular tweet which triggers a search task to retrieve further related tweets. In this paper, we formalize the SbT problem and propose an effective and efficient framework implementing such a functionality. At the core, we model the public Twitter stream as a dynamic graph-of-words, reflecting the importance of both words and word correlations. Given an input tweet, our framework utilizes the graph model to generate an implicit query. Our techniques demonstrate high efficiency and effectiveness as evaluated using a large-scale Twitter dataset and a user study.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
As pre-processing steps, we remove reply-tweets, user names and stopwords. All hashtags are retained and no stemming is applied.
 
3
Features: burst_sum, graphAvgCorr_min, clustDeg_min, clustRelDeg_sum, clustSumCorr_max.
 
4
Thus, 96.4% of tweets have at least one ‘relevant’ query. Among these tweets, on average 4.1 out of 12.5 queries are ‘relevant’.
 
Literatur
2.
Zurück zum Zitat Agarwal, M.K., Ramamritham, K., Bhide, M.: Real time discovery of dense clusters in highly dynamic graphs: identifying real world events in highly dynamic environments. In: VLDB (2012) Agarwal, M.K., Ramamritham, K., Bhide, M.: Real time discovery of dense clusters in highly dynamic graphs: identifying real world events in highly dynamic environments. In: VLDB (2012)
3.
Zurück zum Zitat Aggarwal, C.C., Zhao, Y., Yu, P.S.: On clustering graph streams. In: SIAM, pp. 478–489 (2010) Aggarwal, C.C., Zhao, Y., Yu, P.S.: On clustering graph streams. In: SIAM, pp. 478–489 (2010)
4.
Zurück zum Zitat Angel, A., Sarkas, N., Koudas, N., Srivastava, D.: Dense subgraph maintenance under streaming edge weight updates for real-time story identification. In: VLDB (2012) Angel, A., Sarkas, N., Koudas, N., Srivastava, D.: Dense subgraph maintenance under streaming edge weight updates for real-time story identification. In: VLDB (2012)
5.
Zurück zum Zitat Dalton, J., Dietz, L., Allan, J.: Entity query feature expansion using knowledge base links. In: SIGIR (2014) Dalton, J., Dietz, L., Allan, J.: Entity query feature expansion using knowledge base links. In: SIGIR (2014)
6.
Zurück zum Zitat Efron, M., Golovchinsky, G.: Estimation methods for ranking recent information. In: SIGIR (2011) Efron, M., Golovchinsky, G.: Estimation methods for ranking recent information. In: SIGIR (2011)
7.
Zurück zum Zitat Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: ACL (2005) Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: ACL (2005)
8.
Zurück zum Zitat Gao, J., Xu, G., Xu, J.: Query expansion using path-constrained random walks. In: SIGIR (2013) Gao, J., Xu, G., Xu, J.: Query expansion using path-constrained random walks. In: SIGIR (2013)
9.
Zurück zum Zitat He, Q., Chang, K., Lim, E.P.: Using burstiness to improve clustering of topics in news streams. In: ICDM (2007) He, Q., Chang, K., Lim, E.P.: Using burstiness to improve clustering of topics in news streams. In: ICDM (2007)
10.
Zurück zum Zitat Kim, Y., Croft, W.B.: Diversifying query suggestions based on query documents. In: SIGIR (2014) Kim, Y., Croft, W.B.: Diversifying query suggestions based on query documents. In: SIGIR (2014)
11.
Zurück zum Zitat Kleinberg, J.: Bursty and hierarchical structure in streams. In: KDD (2002) Kleinberg, J.: Bursty and hierarchical structure in streams. In: KDD (2002)
12.
Zurück zum Zitat Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. Intell. 97(1–2), 273–324 (1997)CrossRefMATH Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. Intell. 97(1–2), 273–324 (1997)CrossRefMATH
13.
Zurück zum Zitat Lappas, T., Arai, B., Platakis, M., Kotsakos, D., Gunopulos, D.: On burstiness-aware search for document sequences. In: KDD, p. 477 (2009) Lappas, T., Arai, B., Platakis, M., Kotsakos, D., Gunopulos, D.: On burstiness-aware search for document sequences. In: KDD, p. 477 (2009)
14.
Zurück zum Zitat Lee, P., Lakshmanan, L.V., Milios, E.E.: Incremental cluster evolution tracking from highly dynamic network data. In: ICDE (2014) Lee, P., Lakshmanan, L.V., Milios, E.E.: Incremental cluster evolution tracking from highly dynamic network data. In: ICDE (2014)
15.
Zurück zum Zitat Massoudi, K., Tsagkias, M., Rijke, M., Weerkamp, W.: Incorporating query expansion and quality indicators in searching microblog posts. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 362–367. Springer, Heidelberg (2011). doi:10.1007/978-3-642-20161-5_36 CrossRef Massoudi, K., Tsagkias, M., Rijke, M., Weerkamp, W.: Incorporating query expansion and quality indicators in searching microblog posts. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 362–367. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-20161-5_​36 CrossRef
16.
Zurück zum Zitat Miyanishi, T., Seki, K., Uehara, K.: Improving pseudo-relevance feedback via tweet selection. In: CIKM (2013) Miyanishi, T., Seki, K., Uehara, K.: Improving pseudo-relevance feedback via tweet selection. In: CIKM (2013)
17.
Zurück zum Zitat Teevan, J., Ramage, D., Morris, M.R.: #TwitterSearch: a comparison of microblog search and web search. In: WSDM (2011) Teevan, J., Ramage, D., Morris, M.R.: #TwitterSearch: a comparison of microblog search and web search. In: WSDM (2011)
18.
Zurück zum Zitat Wang, C., Zhang, M., Ru, L., Ma, S.: Automatic online news topic ranking using media focus and user attention based on aging theory. In: CIKM, October 2008 Wang, C., Zhang, M., Ru, L., Ma, S.: Automatic online news topic ranking using media focus and user attention based on aging theory. In: CIKM, October 2008
19.
Zurück zum Zitat Yuan, M., Wu, K.-L., Jacques-Silva, G., Lu, Y.: Efficient processing of streaming graphs for evolution-aware clustering categories and subject descriptors. In: CIKM (2013) Yuan, M., Wu, K.-L., Jacques-Silva, G., Lu, Y.: Efficient processing of streaming graphs for evolution-aware clustering categories and subject descriptors. In: CIKM (2013)
20.
Zurück zum Zitat Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). doi:10.1007/978-3-642-20161-5_34 CrossRef Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-20161-5_​34 CrossRef
Metadaten
Titel
Towards a Query-Less News Search Framework on Twitter
verfasst von
Xiaotian Hao
Ji Cheng
Jan Vosecky
Wilfred Ng
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-55699-4_9