Skip to main content

2019 | OriginalPaper | Buchkapitel

Named Entity Recognition in Local Intent Web Search Queries

verfasst von : Saloni Mittal, Manoj K. Agarwal

Erschienen in: Database and Expert Systems Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Semantic understanding of web queries is a challenging problem as web queries are short, noisy and usually do not observe the grammar of a written language. In this paper, we specifically study the user web search queries with local intent on Bing. Local intent queries deal with searching for local businesses and services in a location. Hence, local query parsing translates into the classical problem of Named Entity Recognition (NER) in NLP. State-of-the-art NER systems rely heavily on hand-crafted features and domain-specific knowledge to effectively learn from the small, supervised training corpora that is available. In this paper, we use deep learnt neural model that relies solely on features extracted from word embeddings learnt in an unsupervised way, using search logs. We propose a novel technique for generating domain specific embeddings and show that they significantly improve the performance of existing models for the NER task. Our model outperforms the existing CRF based parser currently used in production.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceeding of NAACL-HLT (2016) Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceeding of NAACL-HLT (2016)
3.
Zurück zum Zitat Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of European Chapter of the Association for Computational Linguistics (EACL) (2017) Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of European Chapter of the Association for Computational Linguistics (EACL) (2017)
4.
Zurück zum Zitat Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Proceeding of Empirical Methods in Natural Language Processing (EMNLP) (2014) Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Proceeding of Empirical Methods in Natural Language Processing (EMNLP) (2014)
6.
Zurück zum Zitat Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. (JMLR) 12, 2493–2537 (2011)MATH Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. (JMLR) 12, 2493–2537 (2011)MATH
7.
Zurück zum Zitat Cowan, B., Zethelius, S., Luk, B., Baras, T., Ukarde, P., Zhang, D.: Named entity recognition in travel-related search queries. In: Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence (2015) Cowan, B., Zethelius, S., Luk, B., Baras, T., Ukarde, P., Zhang, D.: Named entity recognition in travel-related search queries. In: Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence (2015)
8.
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.I., Lafferty, J.: Latent dirichlet allocation. J. Mach. Learn. Res. (JMLR) 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I., Lafferty, J.: Latent dirichlet allocation. J. Mach. Learn. Res. (JMLR) 3, 993–1022 (2003)MATH
9.
Zurück zum Zitat Lafferty, J., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning (2001) Lafferty, J., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning (2001)
10.
Zurück zum Zitat Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
11.
Zurück zum Zitat Eiselt, A., Figueroa. A.: A two-step named entity recognizer for open-domain search queries. In: Proceedings of the Fourth International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 829–833 (2013) Eiselt, A., Figueroa. A.: A two-step named entity recognizer for open-domain search queries. In: Proceedings of the Fourth International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 829–833 (2013)
12.
13.
Zurück zum Zitat Pasca, M.: Weakly-supervised discovery of named entities using web search queries. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management - CIKM 2007, pp. 683, New York (2007) Pasca, M.: Weakly-supervised discovery of named entities using web search queries. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management - CIKM 2007, pp. 683, New York (2007)
14.
Zurück zum Zitat Du, J., Zhang, Z., Yan, J., Cui, Y., Chen, Z.: Using search session context for named entity recognition in query. In: Proceeding of the 33rd SIGIR (2010) Du, J., Zhang, Z., Yan, J., Cui, Y., Chen, Z.: Using search session context for named entity recognition in query. In: Proceeding of the 33rd SIGIR (2010)
15.
Zurück zum Zitat Guo, J., Xu, G., Cheng, X., Li, H.: 2009 named entity recognition in query. In: Proceedings of the 32nd SIGIR (2009) Guo, J., Xu, G., Cheng, X., Li, H.: 2009 named entity recognition in query. In: Proceedings of the 32nd SIGIR (2009)
17.
Zurück zum Zitat Firth, J.R.: A synopsis of linguistic theory 1930–1955. In: Studies in Linguistic Analysis, pp. 1–32. Oxford Philological Society, Selected Papers of 1952–1959, London (1968) Firth, J.R.: A synopsis of linguistic theory 1930–1955. In: Studies in Linguistic Analysis, pp. 1–32. Oxford Philological Society, Selected Papers of 1952–1959, London (1968)
18.
Zurück zum Zitat Glorot, X., Bengio, Y.: Understanding the difficulty in training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), Sardinia, Italy (2010) Glorot, X., Bengio, Y.: Understanding the difficulty in training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), Sardinia, Italy (2010)
Metadaten
Titel
Named Entity Recognition in Local Intent Web Search Queries
verfasst von
Saloni Mittal
Manoj K. Agarwal
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-27615-7_31

Premium Partner