Skip to main content

2017 | OriginalPaper | Buchkapitel

Shifting Complexity from Text to Data Model

Adding Machine-Oriented Features to a Human-Oriented Terminology Resource

verfasst von : Karolina Suchowolec, Christian Lang, Roman Schneider, Horst Schwinn

Erschienen in: Language, Data, and Knowledge

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Grammis is a web-based information system on German grammar, hosted by the Institute for the German Language (IDS). It is human-oriented and features different theoretical perspectives on grammar. Currently, the terminology component of grammis is being redesigned for this theoretical diversity to play a more prominent role in the data model. This also opens opportunities for implementing some machine-oriented features. In this paper, we present the re-design of both data model and knowledge base. We explore how the addition of machine-oriented features to the data model impacts the knowledge base; in particular, how this addition shifts some of the textual complexity into the data model. We show that our resource can easily be ported to a SKOS-XL representation, which makes it available for data science, knowledge-based NLP applications, and LOD in the context of digital humanities.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
12
Kommunikanten-Pronomen refers to the speaker or the person being addressed (I, you, we), whereas anaphorisches Personalpronomen refers to third parties other than speaker or the person being addressed (he, she, it, they).
 
13
Duden. Die Grammatik is a standard reference for German grammar [12].
 
14
Compare: Possessiv-Artikel vs. possessivisches Artikelwort (‘possessive article word’); Demonstrativ-Artikel vs. demonstrativisches Artikelwort (‘demonstrative article word’).
 
Literatur
1.
Zurück zum Zitat Bubenhofer, N., Schneider, R.: Using a domain ontology for the semantic-statistical classification of specialist hypertexts. In: Papers from the Annual International Conference on Computational Linguistics “Dialogue”, Moscow, 26 May 2010/30 May 2010, pp. 622–628 (2010) Bubenhofer, N., Schneider, R.: Using a domain ontology for the semantic-statistical classification of specialist hypertexts. In: Papers from the Annual International Conference on Computational Linguistics “Dialogue”, Moscow, 26 May 2010/30 May 2010, pp. 622–628 (2010)
2.
Zurück zum Zitat Chiarcos, C., Fäth, C., Renner-Westermann, H., Abromeit, F., Dimitrova, V.: Lin\(|\)gu\(|\)is\(|\)tik: building the linguist’s pathway to bibliographies, libraries, language resources and Linked Open Data. In: Calzolari, N., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. (eds.) Proceedings of LREC 2016, Tenth International Conference on Language Resources and Evaluation, Portorož, Slovenia, 23 May 2016/28 May 2016 (2016) Chiarcos, C., Fäth, C., Renner-Westermann, H., Abromeit, F., Dimitrova, V.: Lin\(|\)gu\(|\)is\(|\)tik: building the linguist’s pathway to bibliographies, libraries, language resources and Linked Open Data. In: Calzolari, N., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. (eds.) Proceedings of LREC 2016, Tenth International Conference on Language Resources and Evaluation, Portorož, Slovenia, 23 May 2016/28 May 2016 (2016)
3.
Zurück zum Zitat Deutscher Terminologie-Tag e.V.: Terminologiearbeit-Best Practices, 2.0 edn. DTT, Köln, Ordner (2014) Deutscher Terminologie-Tag e.V.: Terminologiearbeit-Best Practices, 2.0 edn. DTT, Köln, Ordner (2014)
4.
Zurück zum Zitat Faber, P.: Frames as framework for terminology. In: Kockaert, H.J., Steurs, F. (eds.) Handbook of Terminology, pp. 14–33. John Benjamins Publishing Company, Amsterdam/Philadelphia (2015) Faber, P.: Frames as framework for terminology. In: Kockaert, H.J., Steurs, F. (eds.) Handbook of Terminology, pp. 14–33. John Benjamins Publishing Company, Amsterdam/Philadelphia (2015)
5.
Zurück zum Zitat Faber, P., Martínez, S.M., Prieto, M.R.C., Ruiz, J.S., Velasco, J.A.P., León-Araúz, P., Linares, C.M., Expósito, M.V.: Process-oriented terminology management in the domain of coastal engineering. Terminology 12(2), 189–213 (2006)CrossRef Faber, P., Martínez, S.M., Prieto, M.R.C., Ruiz, J.S., Velasco, J.A.P., León-Araúz, P., Linares, C.M., Expósito, M.V.: Process-oriented terminology management in the domain of coastal engineering. Terminology 12(2), 189–213 (2006)CrossRef
6.
Zurück zum Zitat Huijsen, W.O.: Controlled language–an introduction. In: Proceedings of the Second International Workshop on Controlled Language Application, CLAW 1998, Pittsburgh, Pennsylvania, 21 May 1998/22 May 1998, pp. 1–15 (1998) Huijsen, W.O.: Controlled language–an introduction. In: Proceedings of the Second International Workshop on Controlled Language Application, CLAW 1998, Pittsburgh, Pennsylvania, 21 May 1998/22 May 1998, pp. 1–15 (1998)
7.
Zurück zum Zitat León Araúz, P., Magaña Redondo, P.J.: Ecolexicon: contextualizing an environmental ontology. In: Proceedings of the Terminology and Knowledge Engineering (TKE) Conference, pp. 341–355 (2010) León Araúz, P., Magaña Redondo, P.J.: Ecolexicon: contextualizing an environmental ontology. In: Proceedings of the Terminology and Knowledge Engineering (TKE) Conference, pp. 341–355 (2010)
8.
Zurück zum Zitat Pareja-Lora, A., Brümmer, M., Chiarcos, C.: General introduction to Open Data, Linked Data, Linked Open Data, and Linked Open Data in linguistics. In: Workshop on Development of Linguistic Linked Open Data (LLOD). Resources for Collaborative Data-intensive Research in the Language Sciences, Chicago, 25 July 2015/26 July 2015. Presentation Slides. LSA Summer Institute (2015) Pareja-Lora, A., Brümmer, M., Chiarcos, C.: General introduction to Open Data, Linked Data, Linked Open Data, and Linked Open Data in linguistics. In: Workshop on Development of Linguistic Linked Open Data (LLOD). Resources for Collaborative Data-intensive Research in the Language Sciences, Chicago, 25 July 2015/26 July 2015. Presentation Slides. LSA Summer Institute (2015)
9.
Zurück zum Zitat Suchowolec, K., Lang, C., Schneider, R.: Re-designing online terminology resources for German grammar. In: Mayr, P., Tudhope, D., Golub, K., Wartena, C., Luca, E.W.D. (eds.) Proceedings of the 15th European Networked Knowledge Organization Systems Workshop (NKOS 2016), Hannover, 09 September 2016, pp. 59–63 (2016) Suchowolec, K., Lang, C., Schneider, R.: Re-designing online terminology resources for German grammar. In: Mayr, P., Tudhope, D., Golub, K., Wartena, C., Luca, E.W.D. (eds.) Proceedings of the 15th European Networked Knowledge Organization Systems Workshop (NKOS 2016), Hannover, 09 September 2016, pp. 59–63 (2016)
10.
Zurück zum Zitat Suchowolec, K., Lang, C., Schneider, R.: Grammar and its terminology. Re-designing terminology management system according to best practices (forthcoming) Suchowolec, K., Lang, C., Schneider, R.: Grammar and its terminology. Re-designing terminology management system according to best practices (forthcoming)
11.
Zurück zum Zitat Temmerman, R.: Towards New Ways of Terminology Description. The Sociocognitive Approach. John Benjamins Publishing Company, Amsterdam/Philadelphia (2000)CrossRef Temmerman, R.: Towards New Ways of Terminology Description. The Sociocognitive Approach. John Benjamins Publishing Company, Amsterdam/Philadelphia (2000)CrossRef
12.
Zurück zum Zitat Wöllstein, A., Dudenredaktion (eds.): Duden. Die Grammatik, 9th edn. Dudenverlag, Berlin (2016) Wöllstein, A., Dudenredaktion (eds.): Duden. Die Grammatik, 9th edn. Dudenverlag, Berlin (2016)
Metadaten
Titel
Shifting Complexity from Text to Data Model
verfasst von
Karolina Suchowolec
Christian Lang
Roman Schneider
Horst Schwinn
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-59888-8_18

Premium Partner