Skip to main content
Top
Published in: Cluster Computing 2/2019

08-03-2018

RETRACTED ARTICLE: Research on Chinese and English language information retrieval algorithm based on bilingual theme model

Author: Beibei Dai

Published in: Cluster Computing | Special Issue 2/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In order to meet the needs of users to query English information through their mother tongue, this paper studies the feasibility of cross language information retrieval by bilingual theme model from the point of view of precise correspondence search between Chinese and English topic words, and combines bilingual theme space constructed by bilingual related topics. A matching algorithm based on the weight value of each topic to respond to the topic correlation is presented in descending order according to the weight statistics. The results of data analysis show that the retrieval algorithm proposed in this paper is feasible, accurate, and highly interactive with users. It can basically achieve the retrieval of English information by BITC, and satisfy users’ accessibility to English information through native language.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Kassim, M.N., Maarof, M.A., Zainal, A., et al.: Word stemming methods for the malay language: a review. Adv. Sci. Lett. 23(2), 695–698 (2017)CrossRef Kassim, M.N., Maarof, M.A., Zainal, A., et al.: Word stemming methods for the malay language: a review. Adv. Sci. Lett. 23(2), 695–698 (2017)CrossRef
2.
go back to reference Ong, G., Sewell, D.K., Weekes, B., et al.: A diffusion model approach to analysing the bilingual advantage for the Flanker task: the role of attentional control processes. J. Neurolinguistics 43, 28–38 (2017)CrossRef Ong, G., Sewell, D.K., Weekes, B., et al.: A diffusion model approach to analysing the bilingual advantage for the Flanker task: the role of attentional control processes. J. Neurolinguistics 43, 28–38 (2017)CrossRef
3.
go back to reference Weisleder, A.: Towards a bioecological model of bilingual development. Bilingualism 20(1), 35–36 (2017)CrossRef Weisleder, A.: Towards a bioecological model of bilingual development. Bilingualism 20(1), 35–36 (2017)CrossRef
4.
go back to reference Wang, R., Zhao, H., Lu, B.L., et al.: Bilingual continuous-space language model growing for statistical machine translation. IEEE Trans. Audio Speech Lang. Process. 23(7), 1209–1220 (2015)CrossRef Wang, R., Zhao, H., Lu, B.L., et al.: Bilingual continuous-space language model growing for statistical machine translation. IEEE Trans. Audio Speech Lang. Process. 23(7), 1209–1220 (2015)CrossRef
5.
go back to reference Wang, R., Zhao, H., Lu, B.L., et al.: Neural network based Bilingual language model growing for statistical machine translation. In: EMNLP, pp. 189–195 (2014) Wang, R., Zhao, H., Lu, B.L., et al.: Neural network based Bilingual language model growing for statistical machine translation. In: EMNLP, pp. 189–195 (2014)
6.
go back to reference Ap, S.C., Lauly, S., Larochelle, H., et al.: An autoencoder approach to learning bilingual word representations. In: Advances in Neural Information Processing Systems, pp. 1853–1861 (2014) Ap, S.C., Lauly, S., Larochelle, H., et al.: An autoencoder approach to learning bilingual word representations. In: Advances in Neural Information Processing Systems, pp. 1853–1861 (2014)
7.
go back to reference Schwartz, M.: The impact of the first language first model on vocabulary development among preschool bilingual children. Read. Writ. 27(4), 709–732 (2014)CrossRef Schwartz, M.: The impact of the first language first model on vocabulary development among preschool bilingual children. Read. Writ. 27(4), 709–732 (2014)CrossRef
8.
go back to reference Gouws, S., Bengio, Y., Corrado, G.: Bilbowa: fast bilingual distributed representations without word alignments. In: Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pp. 748–756 (2015) Gouws, S., Bengio, Y., Corrado, G.: Bilbowa: fast bilingual distributed representations without word alignments. In: Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pp. 748–756 (2015)
9.
go back to reference Menken, K., Solorza, C.: No child left bilingual: accountability and the elimination of bilingual education programs in New York City schools. Educ. Policy 28(1), 96–125 (2014)CrossRef Menken, K., Solorza, C.: No child left bilingual: accountability and the elimination of bilingual education programs in New York City schools. Educ. Policy 28(1), 96–125 (2014)CrossRef
10.
go back to reference Kočiský, T., Hermann, K.M., Blunsom, P.: Learning bilingual word representations by marginalizing alignments. arXiv preprint arXiv:1405.0947 (2014) Kočiský, T., Hermann, K.M., Blunsom, P.: Learning bilingual word representations by marginalizing alignments. arXiv preprint arXiv:1405.0947 (2014)
11.
go back to reference Luong, T., Pham, H., Manning, C.D.: Bilingual word representations with monolingual quality in mind. In: VS@ HLT-NAACL, pp. 151–159 (2015) Luong, T., Pham, H., Manning, C.D.: Bilingual word representations with monolingual quality in mind. In: VS@ HLT-NAACL, pp. 151–159 (2015)
12.
go back to reference Wu, H., Dong, D., Hu, X., et al.: Improve statistical machine translation with context-sensitive bilingual semantic embedding model. In: EMNLP, pp. 142–146 (2014) Wu, H., Dong, D., Hu, X., et al.: Improve statistical machine translation with context-sensitive bilingual semantic embedding model. In: EMNLP, pp. 142–146 (2014)
13.
go back to reference Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. ACM 51(2), 268–276 (2017) Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. ACM 51(2), 268–276 (2017)
14.
go back to reference Lafferty, J., Zhai, C.: Document language models, query models, and risk minimization for information retrieval. ACM 51(2), 251–259 (2017) Lafferty, J., Zhai, C.: Document language models, query models, and risk minimization for information retrieval. ACM 51(2), 251–259 (2017)
15.
go back to reference Büttcher, S., Clarke, C.L.A., Cormack, G.V.: Information retrieval: Implementing and evaluating search engines. Mit Press, Cambridge (2016)MATH Büttcher, S., Clarke, C.L.A., Cormack, G.V.: Information retrieval: Implementing and evaluating search engines. Mit Press, Cambridge (2016)MATH
16.
go back to reference Dumais, S., Cutrell, E., Cadiz, J.J., et al.: Stuff I’ve seen: a system for personal information retrieval and re-use. ACM 49(2), 28–35 (2016) Dumais, S., Cutrell, E., Cadiz, J.J., et al.: Stuff I’ve seen: a system for personal information retrieval and re-use. ACM 49(2), 28–35 (2016)
17.
go back to reference Shen, Y., He, X., Gao, J., et al.: A latent semantic model with convolutional-pooling structure for information retrieval. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, pp. 101–110 (2014) Shen, Y., He, X., Gao, J., et al.: A latent semantic model with convolutional-pooling structure for information retrieval. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, pp. 101–110 (2014)
18.
go back to reference Palangi, H., Deng, L., Shen, Y., et al.: Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 24(4), 694–707 (2016)CrossRef Palangi, H., Deng, L., Shen, Y., et al.: Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 24(4), 694–707 (2016)CrossRef
19.
go back to reference Gupta, M., Bendersky, M.: Information retrieval with verbose queries. Found. Trends Inf. Retr. 9(3–4), 209–354 (2015)CrossRef Gupta, M., Bendersky, M.: Information retrieval with verbose queries. Found. Trends Inf. Retr. 9(3–4), 209–354 (2015)CrossRef
20.
go back to reference Gupta, P., Bali, K., Banchs, R.E., et al.: Query expansion for mixed-script information retrieval. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. ACM, pp. 677–686 (2014) Gupta, P., Bali, K., Banchs, R.E., et al.: Query expansion for mixed-script information retrieval. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. ACM, pp. 677–686 (2014)
Metadata
Title
RETRACTED ARTICLE: Research on Chinese and English language information retrieval algorithm based on bilingual theme model
Author
Beibei Dai
Publication date
08-03-2018
Publisher
Springer US
Published in
Cluster Computing / Issue Special Issue 2/2019
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-018-2218-8

Other articles of this Special Issue 2/2019

Cluster Computing 2/2019 Go to the issue

Premium Partner