Skip to main content
Top

2017 | OriginalPaper | Chapter

Keyword-Based Search on Bilingual Digital Libraries

Authors : Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović

Published in: Semantic Keyword-Based Search on Structured Data Sources

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Bibliša supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic dictionaries, SQL and NoSQL databases, which are distributed in different servers accessed in various ways. The web application has been tested on a collection of texts from 3 journals and 2 projects, comprising 299 documents generated from TMX, stored in a NoSQL database. The tool allows the full-text and metadata search, with extraction of concordance sentence pairs for translation and terminology work support.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Gavrilidou, M., Labropoulou, P., Desipri, E., Giouli, V., Antonopoulos, V., Piperidis, S.: Building parallel corpora for econtent professionals. In: Proceedings of the Workshop on Multilingual Linguistic Resources, pp. 97–100. Association for Computational Linguistics (2004) Gavrilidou, M., Labropoulou, P., Desipri, E., Giouli, V., Antonopoulos, V., Piperidis, S.: Building parallel corpora for econtent professionals. In: Proceedings of the Workshop on Multilingual Linguistic Resources, pp. 97–100. Association for Computational Linguistics (2004)
2.
go back to reference Graën, J., Clematide, S., Volk, M.: Efficient exploration of translation variants in large multiparallel corpora using a relational database. In: 4th WS on Challenges in the Management of Large Corpora (Workshop Programme), p. 20 (2016) Graën, J., Clematide, S., Volk, M.: Efficient exploration of translation variants in large multiparallel corpora using a relational database. In: 4th WS on Challenges in the Management of Large Corpora (Workshop Programme), p. 20 (2016)
4.
go back to reference Kovačević, L., Injac, V., Begenišić, D.: Bibliotekarski terminološki rečnik: englesko-srpski, srpsko-engleski [Library Terminological Dictionary: English-Serbian, Serbian-English]. Narodna biblioteka Srbije (2004) Kovačević, L., Injac, V., Begenišić, D.: Bibliotekarski terminološki rečnik: englesko-srpski, srpsko-engleski [Library Terminological Dictionary: English-Serbian, Serbian-English]. Narodna biblioteka Srbije (2004)
5.
go back to reference Krstev, C.: Processing of Serbian – Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade, Belgrade (2008) Krstev, C.: Processing of Serbian – Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade, Belgrade (2008)
6.
go back to reference Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V., Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V.: Digital libraries in the knowledge era: knowledge management and semantic web technologies. Libr. Manage. 26(4/5), 170–175 (2005)CrossRef Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V., Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V.: Digital libraries in the knowledge era: knowledge management and semantic web technologies. Libr. Manage. 26(4/5), 170–175 (2005)CrossRef
7.
go back to reference Obradović, I., Stanković, R., Utvić, M.: An integrated environment for development of parallel corpora. In: Die Unterschiede zwischen dem Bosnischen/Bosniakischen, Kroatischen und Serbischen, pp. 563–578 (2008) Obradović, I., Stanković, R., Utvić, M.: An integrated environment for development of parallel corpora. In: Die Unterschiede zwischen dem Bosnischen/Bosniakischen, Kroatischen und Serbischen, pp. 563–578 (2008)
8.
go back to reference Radovanović, M., Ivanović, M.: Text mining: approaches and applications. Novi Sad J. Math. 38(3), 227–234 (2008)MATH Radovanović, M., Ivanović, M.: Text mining: approaches and applications. Novi Sad J. Math. 38(3), 227–234 (2008)MATH
9.
go back to reference Savourel, Y.: TMX 1.4 b Specification, The Localisation Industry Standards Association (LISA) (2004) Savourel, Y.: TMX 1.4 b Specification, The Localisation Industry Standards Association (LISA) (2004)
10.
go back to reference Stanković, R., Krstev, C., Lazić, B., Vorkapić, D.: A bilingual digital library for academic and entrepreneurial knowledge management. In: Spender, J., Schiuma, G., Albino, V. (eds.) 10th International Forum on Knowledge Asset Dynamics – IFKAD 2015, pp. 1764–1777 (2015). http://www.knowledgeasset.org/Proceedings/ Stanković, R., Krstev, C., Lazić, B., Vorkapić, D.: A bilingual digital library for academic and entrepreneurial knowledge management. In: Spender, J., Schiuma, G., Albino, V. (eds.) 10th International Forum on Knowledge Asset Dynamics – IFKAD 2015, pp. 1764–1777 (2015). http://​www.​knowledgeasset.​org/​Proceedings/​
11.
go back to reference Stanković, R., Krstev, C., Obradović, I., Kitanović, O.: Indexing of textual databases based on lexical resources: a case study for Serbian. In: Cardoso, J., Guerra, F., Houben, G.-J., Pinto, A.M., Velegrakis, Y. (eds.) KEYSTONE 2015. LNCS, vol. 9398, pp. 167–181. Springer, Heidelberg (2015). doi:10.1007/978-3-319-27932-9_15 CrossRef Stanković, R., Krstev, C., Obradović, I., Kitanović, O.: Indexing of textual databases based on lexical resources: a case study for Serbian. In: Cardoso, J., Guerra, F., Houben, G.-J., Pinto, A.M., Velegrakis, Y. (eds.) KEYSTONE 2015. LNCS, vol. 9398, pp. 167–181. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-27932-9_​15 CrossRef
12.
go back to reference Stanković, R., Krstev, C., Obradović, I., Trtovac, A., Utvić, M.: A tool for enhanced search of multilingual digital libraries of e-journals. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012) Stanković, R., Krstev, C., Obradović, I., Trtovac, A., Utvić, M.: A tool for enhanced search of multilingual digital libraries of e-journals. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012)
13.
go back to reference Stanković, R., Obradović, I., Krstev, C., Vitas, D.: Production of morphological dictionaries of multi-word units using a multipurpose tool. In: Jassem, K., Fuglewicz, P.W., Piasecki, M., Przepiórkowski, A. (eds.) Proceedings of the Computational Linguistics-Applications Conference, pp. 77–84 (2011). ISBN: 978-83-60810-47-7 Stanković, R., Obradović, I., Krstev, C., Vitas, D.: Production of morphological dictionaries of multi-word units using a multipurpose tool. In: Jassem, K., Fuglewicz, P.W., Piasecki, M., Przepiórkowski, A. (eds.) Proceedings of the Computational Linguistics-Applications Conference, pp. 77–84 (2011). ISBN: 978-83-60810-47-7
14.
go back to reference Stanković, R., Trivić, B., Kitanović, O., Blagojević, B., Nikolić, V.: The Development of the GeolISSTerm Terminological Dictionary. INFOtheca 12(1), 49a–63a (2011) Stanković, R., Trivić, B., Kitanović, O., Blagojević, B., Nikolić, V.: The Development of the GeolISSTerm Terminological Dictionary. INFOtheca 12(1), 49a–63a (2011)
15.
go back to reference Thong, J.Y., Hong, W., Tam, K.Y.: What leads to user acceptance of digital libraries? Commun. ACM 47(11), 78–83 (2004)CrossRef Thong, J.Y., Hong, W., Tam, K.Y.: What leads to user acceptance of digital libraries? Commun. ACM 47(11), 78–83 (2004)CrossRef
16.
go back to reference Tiedemann, J.: Parallel data, tools and interfaces in OPUS. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012) Tiedemann, J.: Parallel data, tools and interfaces in OPUS. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012)
17.
go back to reference Tufis, D., Cristea, D., Stamou, S.: Balkanet: aims, methods, results and perspectives. A general overview. Rom. J. Inf. Sci. Technol. 7(1–2), 9–43 (2004) Tufis, D., Cristea, D., Stamou, S.: Balkanet: aims, methods, results and perspectives. A general overview. Rom. J. Inf. Sci. Technol. 7(1–2), 9–43 (2004)
18.
go back to reference Volk, M., Graën, J., Callegaro, E.: Innovations in parallel corpus search tools. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3172–3178 (2014) Volk, M., Graën, J., Callegaro, E.: Innovations in parallel corpus search tools. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3172–3178 (2014)
Metadata
Title
Keyword-Based Search on Bilingual Digital Libraries
Authors
Ranka Stanković
Cvetana Krstev
Duško Vitas
Nikola Vulović
Olivera Kitanović
Copyright Year
2017
Publisher
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-319-53640-8_10