Skip to main content
Top

2018 | OriginalPaper | Chapter

Cross-Lingual Information Retrieval: A Dictionary-Based Query Translation Approach

Authors : Vijay Kumar Sharma, Namita Mittal

Published in: Advances in Computer and Computational Sciences

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The rapidly increasing demographics of the Internet population and the abundance of multilingual content on the web increased the communication in multiple languages. Most of the people use their regional languages to express their need and the language diversity becomes a great barrier. Cross-Lingual Information Retrieval (CLIR) provides a solution for that language barrier which allows a user to ask a query in the native language and get the relevant documents in the different language. In this paper we build a dictionary-based query translation system. Queries are tokenized and multi-words query terms are created using N-gram technique. Out of vocabulary (OOV) terms are transliterated using the proposed OOVTTM technique. Target documents are retrieved using vector space retrieval model. Experiment results represent that the proposed approach achieves better results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Nagarathinam A. and Saraswathi S., State of art: Cross Lingual Information Retrieval System for Indian Languages, International Journal of computer application, Vol. 35, No. 13 (2011), pp. 15–21. Nagarathinam A. and Saraswathi S., State of art: Cross Lingual Information Retrieval System for Indian Languages, International Journal of computer application, Vol. 35, No. 13 (2011), pp. 15–21.
2.
go back to reference Abusalah, M., J. Tait, and M. Oakes, Literature Review of Cross Language Information Retrieval. World Academy of Science, Engineering and Technology 4 (2005), 175– 177. Abusalah, M., J. Tait, and M. Oakes, Literature Review of Cross Language Information Retrieval. World Academy of Science, Engineering and Technology 4 (2005), 175– 177.
3.
go back to reference Nasharuddin N. Amelina and Abdullah M. Taufik., Cross-lingual Information Retrieval State-of-the-Art, electronic Journal of Computer Science and Information Technology (EJCSIT), Vol. 2, No. 1 (2010), pp. 1–5. Nasharuddin N. Amelina and Abdullah M. Taufik., Cross-lingual Information Retrieval State-of-the-Art, electronic Journal of Computer Science and Information Technology (EJCSIT), Vol. 2, No. 1 (2010), pp. 1–5.
4.
go back to reference Sujatha P. and Dhavachelvan P., A review on the Cross and Multilingual Information Retrieval, International Journal of Web & Semantic Technology (IJWesT) Vol.2, No.4 (2011), pp. 155–124. Sujatha P. and Dhavachelvan P., A review on the Cross and Multilingual Information Retrieval, International Journal of Web & Semantic Technology (IJWesT) Vol.2, No.4 (2011), pp. 155–124.
5.
go back to reference Sharma VK, Mittal N., Cross Lingual Information Retrieval (CLIR): Review of Tools, Challenges and Translation Approaches. In Information System Design and Intelligent Application (2016), pp. 699–708. Sharma VK, Mittal N., Cross Lingual Information Retrieval (CLIR): Review of Tools, Challenges and Translation Approaches. In Information System Design and Intelligent Application (2016), pp. 699–708.
6.
go back to reference Pingali P. and Varma V., Hindi and Telugu to English cross language information retrieval at CLEF 2006, In working notes of CLEF, LTRC, IIIT Hyderabad (2006). Pingali P. and Varma V., Hindi and Telugu to English cross language information retrieval at CLEF 2006, In working notes of CLEF, LTRC, IIIT Hyderabad (2006).
7.
go back to reference Pingali P. and Varma V., IIIT Hyderabad at CLEF 2007- Adhoc Indian Language CLIR Task, In working notes of CLEF, LTRC, IIIT Hyderabad (2007). Pingali P. and Varma V., IIIT Hyderabad at CLEF 2007- Adhoc Indian Language CLIR Task, In working notes of CLEF, LTRC, IIIT Hyderabad (2007).
8.
go back to reference Makin R., Pandey N., Pingali P., and Varma V., Approximate String Matching Techniques for Effective CLIR, International Workshop on Fuzzy Logic and Applications, Italy, Springer-Verlag (2007), pp. 430–437. Makin R., Pandey N., Pingali P., and Varma V., Approximate String Matching Techniques for Effective CLIR, International Workshop on Fuzzy Logic and Applications, Italy, Springer-Verlag (2007), pp. 430–437.
9.
go back to reference Sethuramalingam S. and Varma V., IIIT Hyderabad’s CLIR Experiments for FIRE-2008, FIRE 2008. Sethuramalingam S. and Varma V., IIIT Hyderabad’s CLIR Experiments for FIRE-2008, FIRE 2008.
10.
go back to reference Janarthanam S.C., Sethuramalingam S. and Nallasamy U., Named Entity Transliteration for Cross-Language Information Retrieval using Compressed Word Format Mapping Algorithm, in proceedings of the 2nd ACM workshop on Improving non English web searching, ACM (2008), pp. 33–38. Janarthanam S.C., Sethuramalingam S. and Nallasamy U., Named Entity Transliteration for Cross-Language Information Retrieval using Compressed Word Format Mapping Algorithm, in proceedings of the 2nd ACM workshop on Improving non English web searching, ACM (2008), pp. 33–38.
11.
go back to reference Jagarlamudi J. and Kumaran A., Cross-Lingual Information Retrieval System for Indian Languages, Advances in multilingual and multi modal information retrieval (2008), pp. 80–87. Jagarlamudi J. and Kumaran A., Cross-Lingual Information Retrieval System for Indian Languages, Advances in multilingual and multi modal information retrieval (2008), pp. 80–87.
12.
go back to reference Pattabhi R. K. Rao and Shobha L., AU-KBC FIRE2010 submission – Cross Lingual Information Retrieval Track: Tamil-English, Fire 2010. Pattabhi R. K. Rao and Shobha L., AU-KBC FIRE2010 submission – Cross Lingual Information Retrieval Track: Tamil-English, Fire 2010.
13.
go back to reference Bajpai P. and Verma P., Cross Language Information Retrieval: In Indian Language Perspective, International Journal of Research in Engineering and Technology, Vol. 3 (2014), pp. 46–52. Bajpai P. and Verma P., Cross Language Information Retrieval: In Indian Language Perspective, International Journal of Research in Engineering and Technology, Vol. 3 (2014), pp. 46–52.
Metadata
Title
Cross-Lingual Information Retrieval: A Dictionary-Based Query Translation Approach
Authors
Vijay Kumar Sharma
Namita Mittal
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-3773-3_59

Premium Partner