Skip to main content

2018 | OriginalPaper | Buchkapitel

Cross-Lingual Information Retrieval: A Dictionary-Based Query Translation Approach

verfasst von : Vijay Kumar Sharma, Namita Mittal

Erschienen in: Advances in Computer and Computational Sciences

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The rapidly increasing demographics of the Internet population and the abundance of multilingual content on the web increased the communication in multiple languages. Most of the people use their regional languages to express their need and the language diversity becomes a great barrier. Cross-Lingual Information Retrieval (CLIR) provides a solution for that language barrier which allows a user to ask a query in the native language and get the relevant documents in the different language. In this paper we build a dictionary-based query translation system. Queries are tokenized and multi-words query terms are created using N-gram technique. Out of vocabulary (OOV) terms are transliterated using the proposed OOVTTM technique. Target documents are retrieved using vector space retrieval model. Experiment results represent that the proposed approach achieves better results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Nagarathinam A. and Saraswathi S., State of art: Cross Lingual Information Retrieval System for Indian Languages, International Journal of computer application, Vol. 35, No. 13 (2011), pp. 15–21. Nagarathinam A. and Saraswathi S., State of art: Cross Lingual Information Retrieval System for Indian Languages, International Journal of computer application, Vol. 35, No. 13 (2011), pp. 15–21.
2.
Zurück zum Zitat Abusalah, M., J. Tait, and M. Oakes, Literature Review of Cross Language Information Retrieval. World Academy of Science, Engineering and Technology 4 (2005), 175– 177. Abusalah, M., J. Tait, and M. Oakes, Literature Review of Cross Language Information Retrieval. World Academy of Science, Engineering and Technology 4 (2005), 175– 177.
3.
Zurück zum Zitat Nasharuddin N. Amelina and Abdullah M. Taufik., Cross-lingual Information Retrieval State-of-the-Art, electronic Journal of Computer Science and Information Technology (EJCSIT), Vol. 2, No. 1 (2010), pp. 1–5. Nasharuddin N. Amelina and Abdullah M. Taufik., Cross-lingual Information Retrieval State-of-the-Art, electronic Journal of Computer Science and Information Technology (EJCSIT), Vol. 2, No. 1 (2010), pp. 1–5.
4.
Zurück zum Zitat Sujatha P. and Dhavachelvan P., A review on the Cross and Multilingual Information Retrieval, International Journal of Web & Semantic Technology (IJWesT) Vol.2, No.4 (2011), pp. 155–124. Sujatha P. and Dhavachelvan P., A review on the Cross and Multilingual Information Retrieval, International Journal of Web & Semantic Technology (IJWesT) Vol.2, No.4 (2011), pp. 155–124.
5.
Zurück zum Zitat Sharma VK, Mittal N., Cross Lingual Information Retrieval (CLIR): Review of Tools, Challenges and Translation Approaches. In Information System Design and Intelligent Application (2016), pp. 699–708. Sharma VK, Mittal N., Cross Lingual Information Retrieval (CLIR): Review of Tools, Challenges and Translation Approaches. In Information System Design and Intelligent Application (2016), pp. 699–708.
6.
Zurück zum Zitat Pingali P. and Varma V., Hindi and Telugu to English cross language information retrieval at CLEF 2006, In working notes of CLEF, LTRC, IIIT Hyderabad (2006). Pingali P. and Varma V., Hindi and Telugu to English cross language information retrieval at CLEF 2006, In working notes of CLEF, LTRC, IIIT Hyderabad (2006).
7.
Zurück zum Zitat Pingali P. and Varma V., IIIT Hyderabad at CLEF 2007- Adhoc Indian Language CLIR Task, In working notes of CLEF, LTRC, IIIT Hyderabad (2007). Pingali P. and Varma V., IIIT Hyderabad at CLEF 2007- Adhoc Indian Language CLIR Task, In working notes of CLEF, LTRC, IIIT Hyderabad (2007).
8.
Zurück zum Zitat Makin R., Pandey N., Pingali P., and Varma V., Approximate String Matching Techniques for Effective CLIR, International Workshop on Fuzzy Logic and Applications, Italy, Springer-Verlag (2007), pp. 430–437. Makin R., Pandey N., Pingali P., and Varma V., Approximate String Matching Techniques for Effective CLIR, International Workshop on Fuzzy Logic and Applications, Italy, Springer-Verlag (2007), pp. 430–437.
9.
Zurück zum Zitat Sethuramalingam S. and Varma V., IIIT Hyderabad’s CLIR Experiments for FIRE-2008, FIRE 2008. Sethuramalingam S. and Varma V., IIIT Hyderabad’s CLIR Experiments for FIRE-2008, FIRE 2008.
10.
Zurück zum Zitat Janarthanam S.C., Sethuramalingam S. and Nallasamy U., Named Entity Transliteration for Cross-Language Information Retrieval using Compressed Word Format Mapping Algorithm, in proceedings of the 2nd ACM workshop on Improving non English web searching, ACM (2008), pp. 33–38. Janarthanam S.C., Sethuramalingam S. and Nallasamy U., Named Entity Transliteration for Cross-Language Information Retrieval using Compressed Word Format Mapping Algorithm, in proceedings of the 2nd ACM workshop on Improving non English web searching, ACM (2008), pp. 33–38.
11.
Zurück zum Zitat Jagarlamudi J. and Kumaran A., Cross-Lingual Information Retrieval System for Indian Languages, Advances in multilingual and multi modal information retrieval (2008), pp. 80–87. Jagarlamudi J. and Kumaran A., Cross-Lingual Information Retrieval System for Indian Languages, Advances in multilingual and multi modal information retrieval (2008), pp. 80–87.
12.
Zurück zum Zitat Pattabhi R. K. Rao and Shobha L., AU-KBC FIRE2010 submission – Cross Lingual Information Retrieval Track: Tamil-English, Fire 2010. Pattabhi R. K. Rao and Shobha L., AU-KBC FIRE2010 submission – Cross Lingual Information Retrieval Track: Tamil-English, Fire 2010.
13.
Zurück zum Zitat Bajpai P. and Verma P., Cross Language Information Retrieval: In Indian Language Perspective, International Journal of Research in Engineering and Technology, Vol. 3 (2014), pp. 46–52. Bajpai P. and Verma P., Cross Language Information Retrieval: In Indian Language Perspective, International Journal of Research in Engineering and Technology, Vol. 3 (2014), pp. 46–52.
Metadaten
Titel
Cross-Lingual Information Retrieval: A Dictionary-Based Query Translation Approach
verfasst von
Vijay Kumar Sharma
Namita Mittal
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-3773-3_59

Premium Partner