Skip to main content

2017 | OriginalPaper | Buchkapitel

An Active Learning Approach to Recognizing Domain-Specific Queries From Query Log

verfasst von : Weijian Ni, Tong Liu, Haohao Sun, Zhensheng Wei

Erschienen in: Web and Big Data

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we address the problem of recognizing domain-specific queries from general search engine’s query log. Unlike most previous work in query classification relying on external resources or annotated training queries, we take query log as the only resource for recognizing domain-specific queries. In the proposed approach, we represent query log as a heterogeneous graph and then formulate the task of domain-specific query recognition as graph-based transductive learning. In order to reduce the impact of noisy and insufficient of initial annotated queries, we further introduce an active learning strategy into the learning process such that the manual annotations needed are reduced and the recognition results can be continuously refined through interactive human supervision. Experimental results demonstrate that the proposed approach is capable of recognizing a certain amount of high-quality domain-specific queries with only a small number of manually annotated queries.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Arguello, J., Diaz, F., Callan, J., Crespo, J.F.: Sources of evidence for vertical selection. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 315–322 (2009) Arguello, J., Diaz, F., Callan, J., Crespo, J.F.: Sources of evidence for vertical selection. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 315–322 (2009)
2.
Zurück zum Zitat Giachanou, A., Salampasis, M., Paltoglou, G.: Multilayer source selection as a tool for supporting patent search and classification. Inf. Retrieval J. 18(6), 559–585 (2015)CrossRef Giachanou, A., Salampasis, M., Paltoglou, G.: Multilayer source selection as a tool for supporting patent search and classification. Inf. Retrieval J. 18(6), 559–585 (2015)CrossRef
3.
Zurück zum Zitat Yan, X., Liu, Y., Fand, Q., Zhang, M., Ma, S., Ru, L.: Domain-specific terms extraction based on web resource and user behavior. J. Softw. (in Chinese) 24(9), 2089–2100 (2013) Yan, X., Liu, Y., Fand, Q., Zhang, M., Ma, S., Ru, L.: Domain-specific terms extraction based on web resource and user behavior. J. Softw. (in Chinese) 24(9), 2089–2100 (2013)
4.
Zurück zum Zitat Shen, D., Pan, R., Sun, J.-T., Pan, J.J., Wu, K., Yin, J., Yang, Q.: Query enrichment for web-query classification. ACM Trans. Inf. Syst. 24, 320–352 (2006)CrossRef Shen, D., Pan, R., Sun, J.-T., Pan, J.J., Wu, K., Yin, J., Yang, Q.: Query enrichment for web-query classification. ACM Trans. Inf. Syst. 24, 320–352 (2006)CrossRef
5.
Zurück zum Zitat Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in Web search. In: Proceedings of the 14th International Conference on World Wide Web, pp. 391–400 (2005) Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in Web search. In: Proceedings of the 14th International Conference on World Wide Web, pp. 391–400 (2005)
6.
Zurück zum Zitat Li, X., Wang, Y., Acero, A.: Learning query intent from regularized click graphs. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 339–346 (2008) Li, X., Wang, Y., Acero, A.: Learning query intent from regularized click graphs. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 339–346 (2008)
7.
Zurück zum Zitat Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. Adv. NIPS 16(16), 321–328 (2004) Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. Adv. NIPS 16(16), 321–328 (2004)
8.
Zurück zum Zitat Zhu, X., Lafferty, J., Ghahramani, Z.: Combining active learning and semi-supervised learning using gaussian fields and harmonic functions. In: ICML 2003 Workshop on the Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining (2003) Zhu, X., Lafferty, J., Ghahramani, Z.: Combining active learning and semi-supervised learning using gaussian fields and harmonic functions. In: ICML 2003 Workshop on the Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining (2003)
9.
Zurück zum Zitat Gu, Q., Zhang, T., Han, J.: Batch-mode active learning via error bound minimization. In: Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence, pp. 300–309 (2014) Gu, Q., Zhang, T., Han, J.: Batch-mode active learning via error bound minimization. In: Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence, pp. 300–309 (2014)
10.
Zurück zum Zitat Shi, L., Zhao, Y., Tang, J.: Batch mode active learning for networked data. ACM Trans. Intell. Syst. Technol. 3(2), 1–25 (2012)CrossRef Shi, L., Zhao, Y., Tang, J.: Batch mode active learning for networked data. ACM Trans. Intell. Syst. Technol. 3(2), 1–25 (2012)CrossRef
11.
Zurück zum Zitat Ji, M., Han, J.: A variance minimization criterion to active learning on graphs. In: Proceedings of the 15th International Conference on Artificial Intelligence and Statistics, pp. 556–564 (2012) Ji, M., Han, J.: A variance minimization criterion to active learning on graphs. In: Proceedings of the 15th International Conference on Artificial Intelligence and Statistics, pp. 556–564 (2012)
12.
Zurück zum Zitat Fuxman, A., Tsaparas, P., Achan, K., Agrawal, R.: Using the wisdom of the crowds for keyword generation. In: Proceeding of the 17th International World Wide Web Conference, pp. 61–70 (2008) Fuxman, A., Tsaparas, P., Achan, K., Agrawal, R.: Using the wisdom of the crowds for keyword generation. In: Proceeding of the 17th International World Wide Web Conference, pp. 61–70 (2008)
13.
Zurück zum Zitat Jiang, D., Leung, K.W.T., Ng, W.: Query intent mining with multiple dimensions of web search data. World Wide Web 19(3), 475–497 (2016)CrossRef Jiang, D., Leung, K.W.T., Ng, W.: Query intent mining with multiple dimensions of web search data. World Wide Web 19(3), 475–497 (2016)CrossRef
14.
Zurück zum Zitat Hu, Y., Qian, Y., Li, H., Jiang, D., Pei, J., Zheng, Q.: Mining query subtopics from search log data. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 305–314 (2012) Hu, Y., Qian, Y., Li, H., Jiang, D., Pei, J., Zheng, Q.: Mining query subtopics from search log data. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 305–314 (2012)
15.
Zurück zum Zitat Ji, M., Yan, J., Gu, S., Han, J., He, X., Zhang, W.V., Chen, Z.: Learning search tasks in queries and web pages via graph regularization. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 55–64 (2011) Ji, M., Yan, J., Gu, S., Han, J., He, X., Zhang, W.V., Chen, Z.: Learning search tasks in queries and web pages via graph regularization. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 55–64 (2011)
16.
Zurück zum Zitat Li, Y., Hsu, B.J.P., Zhai, C.: Unsupervised identification of synonymous query intent templates for attribute intents. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 2029–2038 (2013) Li, Y., Hsu, B.J.P., Zhai, C.: Unsupervised identification of synonymous query intent templates for attribute intents. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 2029–2038 (2013)
17.
Zurück zum Zitat Qian, Y., Sakai, T., Ye, J., Zheng, Q., Li, C.: Dynamic query intent mining from a search log stream. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 1205–1208 (2013) Qian, Y., Sakai, T., Ye, J., Zheng, Q., Li, C.: Dynamic query intent mining from a search log stream. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 1205–1208 (2013)
18.
Zurück zum Zitat Ren, X., Wang, Y., Yu, X., Yan, J., Chen, Z., Han, J.: Heterogeneous graph-based intent learning with queries, web pages and Wikipedia concepts. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 23–32 (2014) Ren, X., Wang, Y., Yu, X., Yan, J., Chen, Z., Han, J.: Heterogeneous graph-based intent learning with queries, web pages and Wikipedia concepts. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 23–32 (2014)
Metadaten
Titel
An Active Learning Approach to Recognizing Domain-Specific Queries From Query Log
verfasst von
Weijian Ni
Tong Liu
Haohao Sun
Zhensheng Wei
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-63564-4_2