Skip to main content

2017 | OriginalPaper | Buchkapitel

Keyword-Based Search on Bilingual Digital Libraries

verfasst von : Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović

Erschienen in: Semantic Keyword-Based Search on Structured Data Sources

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Bibliša supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic dictionaries, SQL and NoSQL databases, which are distributed in different servers accessed in various ways. The web application has been tested on a collection of texts from 3 journals and 2 projects, comprising 299 documents generated from TMX, stored in a NoSQL database. The tool allows the full-text and metadata search, with extraction of concordance sentence pairs for translation and terminology work support.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Gavrilidou, M., Labropoulou, P., Desipri, E., Giouli, V., Antonopoulos, V., Piperidis, S.: Building parallel corpora for econtent professionals. In: Proceedings of the Workshop on Multilingual Linguistic Resources, pp. 97–100. Association for Computational Linguistics (2004) Gavrilidou, M., Labropoulou, P., Desipri, E., Giouli, V., Antonopoulos, V., Piperidis, S.: Building parallel corpora for econtent professionals. In: Proceedings of the Workshop on Multilingual Linguistic Resources, pp. 97–100. Association for Computational Linguistics (2004)
2.
Zurück zum Zitat Graën, J., Clematide, S., Volk, M.: Efficient exploration of translation variants in large multiparallel corpora using a relational database. In: 4th WS on Challenges in the Management of Large Corpora (Workshop Programme), p. 20 (2016) Graën, J., Clematide, S., Volk, M.: Efficient exploration of translation variants in large multiparallel corpora using a relational database. In: 4th WS on Challenges in the Management of Large Corpora (Workshop Programme), p. 20 (2016)
4.
Zurück zum Zitat Kovačević, L., Injac, V., Begenišić, D.: Bibliotekarski terminološki rečnik: englesko-srpski, srpsko-engleski [Library Terminological Dictionary: English-Serbian, Serbian-English]. Narodna biblioteka Srbije (2004) Kovačević, L., Injac, V., Begenišić, D.: Bibliotekarski terminološki rečnik: englesko-srpski, srpsko-engleski [Library Terminological Dictionary: English-Serbian, Serbian-English]. Narodna biblioteka Srbije (2004)
5.
Zurück zum Zitat Krstev, C.: Processing of Serbian – Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade, Belgrade (2008) Krstev, C.: Processing of Serbian – Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade, Belgrade (2008)
6.
Zurück zum Zitat Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V., Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V.: Digital libraries in the knowledge era: knowledge management and semantic web technologies. Libr. Manage. 26(4/5), 170–175 (2005)CrossRef Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V., Lytras, M., Sicilia, M.A., Davies, J., Kashyap, V.: Digital libraries in the knowledge era: knowledge management and semantic web technologies. Libr. Manage. 26(4/5), 170–175 (2005)CrossRef
7.
Zurück zum Zitat Obradović, I., Stanković, R., Utvić, M.: An integrated environment for development of parallel corpora. In: Die Unterschiede zwischen dem Bosnischen/Bosniakischen, Kroatischen und Serbischen, pp. 563–578 (2008) Obradović, I., Stanković, R., Utvić, M.: An integrated environment for development of parallel corpora. In: Die Unterschiede zwischen dem Bosnischen/Bosniakischen, Kroatischen und Serbischen, pp. 563–578 (2008)
8.
Zurück zum Zitat Radovanović, M., Ivanović, M.: Text mining: approaches and applications. Novi Sad J. Math. 38(3), 227–234 (2008)MATH Radovanović, M., Ivanović, M.: Text mining: approaches and applications. Novi Sad J. Math. 38(3), 227–234 (2008)MATH
9.
Zurück zum Zitat Savourel, Y.: TMX 1.4 b Specification, The Localisation Industry Standards Association (LISA) (2004) Savourel, Y.: TMX 1.4 b Specification, The Localisation Industry Standards Association (LISA) (2004)
10.
Zurück zum Zitat Stanković, R., Krstev, C., Lazić, B., Vorkapić, D.: A bilingual digital library for academic and entrepreneurial knowledge management. In: Spender, J., Schiuma, G., Albino, V. (eds.) 10th International Forum on Knowledge Asset Dynamics – IFKAD 2015, pp. 1764–1777 (2015). http://www.knowledgeasset.org/Proceedings/ Stanković, R., Krstev, C., Lazić, B., Vorkapić, D.: A bilingual digital library for academic and entrepreneurial knowledge management. In: Spender, J., Schiuma, G., Albino, V. (eds.) 10th International Forum on Knowledge Asset Dynamics – IFKAD 2015, pp. 1764–1777 (2015). http://​www.​knowledgeasset.​org/​Proceedings/​
11.
Zurück zum Zitat Stanković, R., Krstev, C., Obradović, I., Kitanović, O.: Indexing of textual databases based on lexical resources: a case study for Serbian. In: Cardoso, J., Guerra, F., Houben, G.-J., Pinto, A.M., Velegrakis, Y. (eds.) KEYSTONE 2015. LNCS, vol. 9398, pp. 167–181. Springer, Heidelberg (2015). doi:10.1007/978-3-319-27932-9_15 CrossRef Stanković, R., Krstev, C., Obradović, I., Kitanović, O.: Indexing of textual databases based on lexical resources: a case study for Serbian. In: Cardoso, J., Guerra, F., Houben, G.-J., Pinto, A.M., Velegrakis, Y. (eds.) KEYSTONE 2015. LNCS, vol. 9398, pp. 167–181. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-27932-9_​15 CrossRef
12.
Zurück zum Zitat Stanković, R., Krstev, C., Obradović, I., Trtovac, A., Utvić, M.: A tool for enhanced search of multilingual digital libraries of e-journals. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012) Stanković, R., Krstev, C., Obradović, I., Trtovac, A., Utvić, M.: A tool for enhanced search of multilingual digital libraries of e-journals. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012)
13.
Zurück zum Zitat Stanković, R., Obradović, I., Krstev, C., Vitas, D.: Production of morphological dictionaries of multi-word units using a multipurpose tool. In: Jassem, K., Fuglewicz, P.W., Piasecki, M., Przepiórkowski, A. (eds.) Proceedings of the Computational Linguistics-Applications Conference, pp. 77–84 (2011). ISBN: 978-83-60810-47-7 Stanković, R., Obradović, I., Krstev, C., Vitas, D.: Production of morphological dictionaries of multi-word units using a multipurpose tool. In: Jassem, K., Fuglewicz, P.W., Piasecki, M., Przepiórkowski, A. (eds.) Proceedings of the Computational Linguistics-Applications Conference, pp. 77–84 (2011). ISBN: 978-83-60810-47-7
14.
Zurück zum Zitat Stanković, R., Trivić, B., Kitanović, O., Blagojević, B., Nikolić, V.: The Development of the GeolISSTerm Terminological Dictionary. INFOtheca 12(1), 49a–63a (2011) Stanković, R., Trivić, B., Kitanović, O., Blagojević, B., Nikolić, V.: The Development of the GeolISSTerm Terminological Dictionary. INFOtheca 12(1), 49a–63a (2011)
15.
Zurück zum Zitat Thong, J.Y., Hong, W., Tam, K.Y.: What leads to user acceptance of digital libraries? Commun. ACM 47(11), 78–83 (2004)CrossRef Thong, J.Y., Hong, W., Tam, K.Y.: What leads to user acceptance of digital libraries? Commun. ACM 47(11), 78–83 (2004)CrossRef
16.
Zurück zum Zitat Tiedemann, J.: Parallel data, tools and interfaces in OPUS. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012) Tiedemann, J.: Parallel data, tools and interfaces in OPUS. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012)
17.
Zurück zum Zitat Tufis, D., Cristea, D., Stamou, S.: Balkanet: aims, methods, results and perspectives. A general overview. Rom. J. Inf. Sci. Technol. 7(1–2), 9–43 (2004) Tufis, D., Cristea, D., Stamou, S.: Balkanet: aims, methods, results and perspectives. A general overview. Rom. J. Inf. Sci. Technol. 7(1–2), 9–43 (2004)
18.
Zurück zum Zitat Volk, M., Graën, J., Callegaro, E.: Innovations in parallel corpus search tools. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3172–3178 (2014) Volk, M., Graën, J., Callegaro, E.: Innovations in parallel corpus search tools. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3172–3178 (2014)
Metadaten
Titel
Keyword-Based Search on Bilingual Digital Libraries
verfasst von
Ranka Stanković
Cvetana Krstev
Duško Vitas
Nikola Vulović
Olivera Kitanović
Copyright-Jahr
2017
Verlag
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-319-53640-8_10

Neuer Inhalt