Skip to main content
Erschienen in: Knowledge and Information Systems 9/2020

10.02.2020 | Regular Paper

PragmaticOIE: a pragmatic open information extraction for Portuguese language

verfasst von: Cleiton Fernando Lima Sena, Daniela Barreiro Claro

Erschienen in: Knowledge and Information Systems | Ausgabe 9/2020

Einloggen

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Information extraction (IE) involves the extraction of useful facts from texts. IE approaches have been categorized into two types: Traditional IE and Open IE. Traditional IE recognizes a predefined set of relationships between the arguments, and it has typically been applied to specific domains. Open IE extracts relationship descriptors expressing any semantic relationship between a pair of arguments in different domains. Although a sentence can have a different meaning, given the context and intention used, a single semantic analysis does not guarantee useful extractions. Extractions depend on the context and the intention inherited in a sentence that goes beyond the semantic meaning. Thus, a pragmatic analysis enhances the set of extractions by considering the contextual and intentional aspects. As a consequence, new facts can be extracted from this set of sentences. The combination of inference, context, and intention enables the extraction of implicit facts from texts achieving a first pragmatic level. This novel approach increases the number of facts, extracting relationships from a sentence analyzing inference, context, and intention. This is the first method to analyze a first pragmatic level from a sentence within a set of Portuguese text documents. Our method was performed over a set of Portuguese text documents and outperforms the most relevant related work comparing accuracy, number of extracted facts, and minimality measures.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Akbik A, Löser A (2012) Kraken: N-ary facts in open information extraction. In: Proceedings of the AKBC-WEKEX, ACL, pp 52–56 Akbik A, Löser A (2012) Kraken: N-ary facts in open information extraction. In: Proceedings of the AKBC-WEKEX, ACL, pp 52–56
2.
Zurück zum Zitat Banko M, Cafarella MJ, Soderland S, Broadhead M, Etzioni O (2007) Open information extraction for the web. Proc IJCAI 7:2670–2676 Banko M, Cafarella MJ, Soderland S, Broadhead M, Etzioni O (2007) Open information extraction for the web. Proc IJCAI 7:2670–2676
3.
Zurück zum Zitat Banko M, Etzioni O, Center T (2008) The tradeoffs between open and traditional relation extraction. In: Proceedings of the ACL, ACL, vol 8, pp 28–36 Banko M, Etzioni O, Center T (2008) The tradeoffs between open and traditional relation extraction. In: Proceedings of the ACL, ACL, vol 8, pp 28–36
4.
Zurück zum Zitat Bast H, Haussmann E (2013) Open information extraction via contextual sentence decomposition. In: Proceedings of the ICSC, IEEE, pp 154–159 Bast H, Haussmann E (2013) Open information extraction via contextual sentence decomposition. In: Proceedings of the ICSC, IEEE, pp 154–159
5.
Zurück zum Zitat Bast H, Haussmann E (2014) More informative open information extraction via simple inference. In: Proceedings of the ECIR, Springer, pp 585–590 Bast H, Haussmann E (2014) More informative open information extraction via simple inference. In: Proceedings of the ECIR, Springer, pp 585–590
6.
Zurück zum Zitat Blühdorn H (1997) A relação entre pragmática, semântica e gramática. Rev Estud Ling 6(2):150–188 Blühdorn H (1997) A relação entre pragmática, semântica e gramática. Rev Estud Ling 6(2):150–188
7.
Zurück zum Zitat Colen W, Finger M (2013) Improving CoGrOO: the Brazilian Portuguese grammar checker. In: Proceedings of the STIL, pp 21–29 Colen W, Finger M (2013) Improving CoGrOO: the Brazilian Portuguese grammar checker. In: Proceedings of the STIL, pp 21–29
8.
Zurück zum Zitat da Costa JC (2009) A teoria inferencial das implicaturas: descrição do modelo clássico de grice. Let Hoje 44(3):12–17 da Costa JC (2009) A teoria inferencial das implicaturas: descrição do modelo clássico de grice. Let Hoje 44(3):12–17
9.
Zurück zum Zitat de Oliveira LS, Glauber R, Claro DB (2017) Dependentie: an open information extraction system on Portuguese by a dependence analysis. In: Proceedings of ENIAC, FC-UFU, pp 271–282 de Oliveira LS, Glauber R, Claro DB (2017) Dependentie: an open information extraction system on Portuguese by a dependence analysis. In: Proceedings of ENIAC, FC-UFU, pp 271–282
10.
Zurück zum Zitat Del Corro L, Gemulla R (2013) Clausie: clause-based open information extraction. In: Proceedings of the WWW, ACM, pp 355–366 Del Corro L, Gemulla R (2013) Clausie: clause-based open information extraction. In: Proceedings of the WWW, ACM, pp 355–366
11.
Zurück zum Zitat Etzioni O, Banko M, Soderland S, Weld DS (2008) Open information extraction from the web. Commun ACM 51(12):68–74CrossRef Etzioni O, Banko M, Soderland S, Weld DS (2008) Open information extraction from the web. Commun ACM 51(12):68–74CrossRef
12.
Zurück zum Zitat Fader A, Soderland S, Etzioni O (2011) Identifying relations for open information extraction. In: Proceedings of the EMNLP, ACL, pp 1535–1545 Fader A, Soderland S, Etzioni O (2011) Identifying relations for open information extraction. In: Proceedings of the EMNLP, ACL, pp 1535–1545
13.
Zurück zum Zitat Faruqui M, Kumar S (2015) Multilingual open relation extraction using cross-lingual projection. In: Proceedings of the NAACL HLT, ACL, pp 1351–1356 Faruqui M, Kumar S (2015) Multilingual open relation extraction using cross-lingual projection. In: Proceedings of the NAACL HLT, ACL, pp 1351–1356
14.
Zurück zum Zitat Gamallo P, Garcia M (2015) Multilingual open information extraction. In: Proceedings of the EPIA, Springer, pp 711–722 Gamallo P, Garcia M (2015) Multilingual open information extraction. In: Proceedings of the EPIA, Springer, pp 711–722
15.
Zurück zum Zitat Gamallo P, Garcia M, Fernández-Lanza S (2012) Dependency-based open information extraction. In: Proceedings of the ROBUS-UNSUP, ACL, pp 10–18 Gamallo P, Garcia M, Fernández-Lanza S (2012) Dependency-based open information extraction. In: Proceedings of the ROBUS-UNSUP, ACL, pp 10–18
16.
Zurück zum Zitat Glauber R, Claro DB (2018) A systematic mapping study on open information extraction. Expert Syst Appl 112:372–387CrossRef Glauber R, Claro DB (2018) A systematic mapping study on open information extraction. Expert Syst Appl 112:372–387CrossRef
17.
Zurück zum Zitat Godoy L (2009) Os verbos recíprocos no pb e a hipótese da determinação semântico-lexical sobre a sintaxe. Rev Ling 53(1):283–299 Godoy L (2009) Os verbos recíprocos no pb e a hipótese da determinação semântico-lexical sobre a sintaxe. Rev Ling 53(1):283–299
18.
Zurück zum Zitat Grice HP (1989) Studies in the way of words. Harvard University Press, Cambridge Grice HP (1989) Studies in the way of words. Harvard University Press, Cambridge
19.
Zurück zum Zitat Hoang TBN, Mothe J (2018) Location extraction from tweets. Inf Process Manag 54(2):129–144CrossRef Hoang TBN, Mothe J (2018) Location extraction from tweets. Inf Process Manag 54(2):129–144CrossRef
20.
Zurück zum Zitat Leão LBC (2014) Implicaturas e a violação das máximas conversacionais: uma análise do humor em tirinhas. Work Pap Ling 14(1):65–79CrossRef Leão LBC (2014) Implicaturas e a violação das máximas conversacionais: uma análise do humor em tirinhas. Work Pap Ling 14(1):65–79CrossRef
21.
Zurück zum Zitat Liu S, Ren F (2011) Paragraph act based pragmatic information extraction in question answering. In: Proceedings of the CCIS, IEEE, pp 153–157 Liu S, Ren F (2011) Paragraph act based pragmatic information extraction in question answering. In: Proceedings of the CCIS, IEEE, pp 153–157
22.
Zurück zum Zitat Schmitz Mausam M, Schmitz M, Bart R, Soderland S, Etzioni O (2012) Open language learning for information extraction. In: Proceedings of the EMNLP–CoNLL, ACL, pp 523–534 Schmitz Mausam M, Schmitz M, Bart R, Soderland S, Etzioni O (2012) Open language learning for information extraction. In: Proceedings of the EMNLP–CoNLL, ACL, pp 523–534
23.
Zurück zum Zitat Mausam M (2016) Open information extraction systems and downstream applications. In: Proceedings of the IJCAI, AAAI Press, pp 4074–4077 Mausam M (2016) Open information extraction systems and downstream applications. In: Proceedings of the IJCAI, AAAI Press, pp 4074–4077
24.
Zurück zum Zitat Morris CW (1938) Foundations of the theory of signs, vol 1. University of Chicago Press, Chicago Morris CW (1938) Foundations of the theory of signs, vol 1. University of Chicago Press, Chicago
25.
Zurück zum Zitat Nazário MdL (2011) Estudo pragmático: a teoria da relevância no processo comunicativo. REVELLI (Rev Educ Ling Lit UEG Inhumas) 3(2):56–67 Nazário MdL (2011) Estudo pragmático: a teoria da relevância no processo comunicativo. REVELLI (Rev Educ Ling Lit UEG Inhumas) 3(2):56–67
26.
Zurück zum Zitat Nebot V, Berlanga R (2014) Exploiting semantic annotations for open information extraction: an experience in the biomedical domain. Knowl Inf Syst 38(2):365–389CrossRef Nebot V, Berlanga R (2014) Exploiting semantic annotations for open information extraction: an experience in the biomedical domain. Knowl Inf Syst 38(2):365–389CrossRef
27.
Zurück zum Zitat Nöth W (1995) Panorama Da Semiotica—de Platão a Percie, vol 3. Annablume, São Paulo Nöth W (1995) Panorama Da Semiotica—de Platão a Percie, vol 3. Annablume, São Paulo
28.
Zurück zum Zitat Sena CFL, Claro DB (2018) Inferportoie: a Portuguese open information extraction system with inferences. Nat Lang Eng 1:1–20 Sena CFL, Claro DB (2018) Inferportoie: a Portuguese open information extraction system with inferences. Nat Lang Eng 1:1–20
29.
Zurück zum Zitat Sena CFL, Glauber R, Claro DB (2017) Inference approach to enhance a Portuguese open information extraction. In: Proceedings of the ICEIS, INSTICC, ScitePress, pp 442–451 Sena CFL, Glauber R, Claro DB (2017) Inference approach to enhance a Portuguese open information extraction. In: Proceedings of the ICEIS, INSTICC, ScitePress, pp 442–451
30.
Zurück zum Zitat Vanbelle S (2018) Asymptotic variability of (multilevel) multirater kappa coefficients. Stat Methods Med Res 28:3012–3026MathSciNetCrossRef Vanbelle S (2018) Asymptotic variability of (multilevel) multirater kappa coefficients. Stat Methods Med Res 28:3012–3026MathSciNetCrossRef
31.
Zurück zum Zitat Vo DT, Bagheri E (2019) Feature-enriched matrix factorization for relation extraction. Inf Process Manag 56(3):424–444CrossRef Vo DT, Bagheri E (2019) Feature-enriched matrix factorization for relation extraction. Inf Process Manag 56(3):424–444CrossRef
32.
Zurück zum Zitat Wu F, Weld DS (2010) Open information extraction using Wikipedia. In: Proceedings of the ACL, ACL, pp 118–127 Wu F, Weld DS (2010) Open information extraction using Wikipedia. In: Proceedings of the ACL, ACL, pp 118–127
33.
Zurück zum Zitat Xavier C, Strube de Lima V, Souza M (2015) Open information extraction based on lexical semantics. J Braz Comput Soc 21:1–14CrossRef Xavier C, Strube de Lima V, Souza M (2015) Open information extraction based on lexical semantics. J Braz Comput Soc 21:1–14CrossRef
Metadaten
Titel
PragmaticOIE: a pragmatic open information extraction for Portuguese language
verfasst von
Cleiton Fernando Lima Sena
Daniela Barreiro Claro
Publikationsdatum
10.02.2020
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 9/2020
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-020-01442-7

Weitere Artikel der Ausgabe 9/2020

Knowledge and Information Systems 9/2020 Zur Ausgabe