Skip to main content
Top

2016 | OriginalPaper | Chapter

Subtopic Mining Based on Three-Level Hierarchical Search Intentions

Authors : Se-Jong Kim, Jaehun Shin, Jong-Hyeok Lee

Published in: Advances in Information Retrieval

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper proposes a subtopic mining method based on three-level hierarchical search intentions. Various subtopic candidates are extracted from web documents using a simple pattern, and higher-level and lower-level subtopics are selected from these candidates. The selected subtopics as second-level subtopics are ranked by a proposed measure, and are expanded and re-ranked considering the characteristics of resources. Using general terms in the higher-level subtopics, we make second-level subtopic groups and generate first-level subtopics. Our method achieved better performance than a state of the art method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Query dimensions are groups of items extracted from the style of lists such as tables in top retrieved documents [6]. Each dimension has a ranked list of its items.
 
Literature
1.
go back to reference Song, R., Zhang, M., Sakai, T., Kato, M.P., Liu, Y., Sugimoto, M., Wang, Q., Orii, N.: Overview of the NTCIR-9 intent task. In: Proceedings of NTCIR-9 Workshop Meeting, pp. 82–105. National Institute of Informatics, Tokyo, Japan (2011) Song, R., Zhang, M., Sakai, T., Kato, M.P., Liu, Y., Sugimoto, M., Wang, Q., Orii, N.: Overview of the NTCIR-9 intent task. In: Proceedings of NTCIR-9 Workshop Meeting, pp. 82–105. National Institute of Informatics, Tokyo, Japan (2011)
2.
go back to reference Sakai, T., Dou, Z., Yamamoto, T., Liu, Y., Zhang, M., Song, R.: Overview of the NTCIR-10 INTENT-2 task. In: Proceedings of NTCIR-10 Workshop Meeting, pp. 94–123. National Institute of Informatics, Tokyo, Japan (2013) Sakai, T., Dou, Z., Yamamoto, T., Liu, Y., Zhang, M., Song, R.: Overview of the NTCIR-10 INTENT-2 task. In: Proceedings of NTCIR-10 Workshop Meeting, pp. 94–123. National Institute of Informatics, Tokyo, Japan (2013)
3.
go back to reference Liu, Y., Song, R., Zhang, M., Dou, Z., Yamamoto, T., Kato, M., Ohshima, H., Zhou, K.: Overview of the NTCIR-11 imine task. In: Proceedings of NTCIR-11 Workshop Meeting, pp. 8–23. National Institute of Informatics, Tokyo, Japan (2014) Liu, Y., Song, R., Zhang, M., Dou, Z., Yamamoto, T., Kato, M., Ohshima, H., Zhou, K.: Overview of the NTCIR-11 imine task. In: Proceedings of NTCIR-11 Workshop Meeting, pp. 8–23. National Institute of Informatics, Tokyo, Japan (2014)
4.
go back to reference Yamamoto, T., Kato, M.P., Ohshima, H., Tanaka, K.: Kuidl at the NTCIR-11 imine task. In: Proceedings of NTCIR-11 Workshop Meeting, pp. 53–54. National Institute of Informatics, Tokyo, Japan (2014) Yamamoto, T., Kato, M.P., Ohshima, H., Tanaka, K.: Kuidl at the NTCIR-11 imine task. In: Proceedings of NTCIR-11 Workshop Meeting, pp. 53–54. National Institute of Informatics, Tokyo, Japan (2014)
5.
go back to reference Luo, C., Li, X., Khodzhaev, A., Chen, F., Xu, K., Cao, Y., Liu, Y., Zhang, M., Ma, S.: Thusam at NTCIR-11 imine task. In: Proceedings of NTCIR-11 Workshop Meeting, pp. 55–62. National Institute of Informatics, Tokyo, Japan (2014) Luo, C., Li, X., Khodzhaev, A., Chen, F., Xu, K., Cao, Y., Liu, Y., Zhang, M., Ma, S.: Thusam at NTCIR-11 imine task. In: Proceedings of NTCIR-11 Workshop Meeting, pp. 55–62. National Institute of Informatics, Tokyo, Japan (2014)
6.
go back to reference Dou, Z., Hu, S., Luo, Y., Song, R., Wen, J.R.: Finding dimensions for queries. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1311–1320. Association for Computing Machinery, Glasgow, Scotland, UK (2011) Dou, Z., Hu, S., Luo, Y., Song, R., Wen, J.R.: Finding dimensions for queries. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1311–1320. Association for Computing Machinery, Glasgow, Scotland, UK (2011)
7.
go back to reference Zeng, H.J., He, Q.C., Chen, Z., Ma, W.Y., Ma, J.: Learning to cluster web search results. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 210–217. Association for Computing Machinery, Sheffield, South Yorkshire, UK (2004) Zeng, H.J., He, Q.C., Chen, Z., Ma, W.Y., Ma, J.: Learning to cluster web search results. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 210–217. Association for Computing Machinery, Sheffield, South Yorkshire, UK (2004)
8.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
9.
go back to reference Robertson, S., Zaragoza, H.: The probabilistic relevance framework: BM25 and beyond. Found. Trends Inf. Retr. 3(4), 333–389 (2009)CrossRef Robertson, S., Zaragoza, H.: The probabilistic relevance framework: BM25 and beyond. Found. Trends Inf. Retr. 3(4), 333–389 (2009)CrossRef
Metadata
Title
Subtopic Mining Based on Three-Level Hierarchical Search Intentions
Authors
Se-Jong Kim
Jaehun Shin
Jong-Hyeok Lee
Copyright Year
2016
Publisher
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-319-30671-1_62