Skip to main content
Top

2020 | OriginalPaper | Chapter

LSTM-Based Neural Network Model for Semantic Search

Authors : Xiaoyu Guo, Jing Ma, Xiaofeng Li

Published in: Smart Service Systems, Operations Management, and Analytics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

To improve web search quality and serve a better search experience for users, it is important to capture semantic information from user query which contains user’s intention in web search. Long Short-Term Memory (LSTM), a significant network in deep learning has made tremendous achievements in capturing semantic information and predicting the semantic relatedness of two sentences. In this study, considering the similarity between predicting the relatedness of sentence pair task and semantic search, we provide a novel channel to process semantic search task: see semantic search as an atypical predicting the relatedness of sentence pair task. Furthermore, we propose an LSTM-Based Neural Network Model which is suitable for predicting the semantic relatedness between user query and potential documents. The proposed LSTM-Based Neural Network Model is trained by Home Depot dataset. Results show that our model outperforms than other models.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference O. Vinyals, A. Toshev, S. Bengio et al., Show and Tell: A Neural Image Caption Generator [J] (2014) O. Vinyals, A. Toshev, S. Bengio et al., Show and Tell: A Neural Image Caption Generator [J] (2014)
2.
go back to reference I. Sutskever, O. Vinyals, Q.V. Le, Sequence to Sequence Learning with Neural Networks [J] (2014) I. Sutskever, O. Vinyals, Q.V. Le, Sequence to Sequence Learning with Neural Networks [J] (2014)
3.
go back to reference P.S. Huang, X. He, J. Gao et al., Learning deep structured semantic models for web search using clickthrough data [C], in Proceedings of the 22nd ACM International Conference on Conference on Information and Knowledge Management (ACM, 2013) P.S. Huang, X. He, J. Gao et al., Learning deep structured semantic models for web search using clickthrough data [C], in Proceedings of the 22nd ACM International Conference on Conference on Information and Knowledge Management (ACM, 2013)
4.
go back to reference H. Nassif, M. Mohtarami, J. Glass, Learning semantic relatedness in community question answering using neural models [C], in Proceedings of the 1st Workshop on Representation Learning for NLP (2016), pp. 137–147 H. Nassif, M. Mohtarami, J. Glass, Learning semantic relatedness in community question answering using neural models [C], in Proceedings of the 1st Workshop on Representation Learning for NLP (2016), pp. 137–147
5.
go back to reference T. Mikolov, Statistical Language Models Based on Neural Networks [J] (Presentation at Google, Mountain View, 2012), p. 80 T. Mikolov, Statistical Language Models Based on Neural Networks [J] (Presentation at Google, Mountain View, 2012), p. 80
6.
go back to reference R. Pascanu, T Mikolov, Y. Bengio, On the difficulty of training recurrent neural networks [C], in International Conference on Machine Learning (2013), pp. 1310–1318 R. Pascanu, T Mikolov, Y. Bengio, On the difficulty of training recurrent neural networks [C], in International Conference on Machine Learning (2013), pp. 1310–1318
7.
go back to reference S. Hochreiter, J. Schmidhuber, Long short-term memory[J]. Neural Comput. 9(8), 1735–1780 (1997)CrossRef S. Hochreiter, J. Schmidhuber, Long short-term memory[J]. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
8.
go back to reference H. Palangi, L. Deng, Y. Shen et al., Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval [J]. IEEE/ACM Trans. Audio Speech Lang. Process (TASLP) 24(4), 694–707 (2016)CrossRef H. Palangi, L. Deng, Y. Shen et al., Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval [J]. IEEE/ACM Trans. Audio Speech Lang. Process (TASLP) 24(4), 694–707 (2016)CrossRef
9.
go back to reference A. Sordoni, Y. Bengio, H. Vahabi et al., A hierarchical recurrent encoder-decoder for generative context-aware query suggestion[C], in Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (ACM, 2015), pp. 553–562 A. Sordoni, Y. Bengio, H. Vahabi et al., A hierarchical recurrent encoder-decoder for generative context-aware query suggestion[C], in Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (ACM, 2015), pp. 553–562
10.
go back to reference K.S. Tai, R. Socher, C.D. Manning, Improved Semantic Representations from Tree-Structured Long Short-Term Memory Networks [J] (2015). arXiv:1503.00075, 2015 K.S. Tai, R. Socher, C.D. Manning, Improved Semantic Representations from Tree-Structured Long Short-Term Memory Networks [J] (2015). arXiv:​1503.​00075, 2015
12.
go back to reference H.S. Joshi, Finding Similar Questions in Large-scale Community QA Forums [D]. Massachusetts Institute of Technology (2016) H.S. Joshi, Finding Similar Questions in Large-scale Community QA Forums [D]. Massachusetts Institute of Technology (2016)
14.
go back to reference J. Pennington, R. Socher, C. Manning, Glove: global vectors for word representation [C], in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014), pp. 1532–1543 J. Pennington, R. Socher, C. Manning, Glove: global vectors for word representation [C], in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014), pp. 1532–1543
15.
go back to reference J. Duchi, E. Hazan, Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization[J]. J. Mach. Learn. Res. 2121–2159 (2011) J. Duchi, E. Hazan, Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization[J]. J. Mach. Learn. Res. 2121–2159 (2011)
16.
go back to reference A.E. Hoerl, R.W. Kennard, Ridge regression: biased estimation for nonorthogonal problems [J]. Technometrics 12(1), 55–67 (1970)CrossRef A.E. Hoerl, R.W. Kennard, Ridge regression: biased estimation for nonorthogonal problems [J]. Technometrics 12(1), 55–67 (1970)CrossRef
Metadata
Title
LSTM-Based Neural Network Model for Semantic Search
Authors
Xiaoyu Guo
Jing Ma
Xiaofeng Li
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-30967-1_3