Skip to main content

2017 | OriginalPaper | Buchkapitel

4. Substring-Based Algorithms

verfasst von : Wojciech Wieczorek

Erschienen in: Grammatical Inference

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter discusses the selected algorithms for induction of automata and grammars in which words comparison for searching their fragments that are changed or fixed, plays the crucial role. The following two approaches are presented: Error-Correcting Grammatical Inference (EGCI) and Alignment-Based Learning (ABL). The former approach incrementally grows an inferred automaton by explicitly minimizing (with the help of dynamic programming) the number of states added when each new word is presented. The latter approach, ABL, learns structure from plain sequences by comparing them. Based on the parts of words that are the same and the parts that are not the same in two words, the structure is inserted into a hypothesis space.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Informally, the Levenshtein distance between two words is the minimum number of single-character edits (i.e. insertions, deletions or substitutions) required to change one word into the other.
 
2
Let \(G = (\{A_0, A_1, \ldots , A_m\}, \varSigma , P, A_0)\) be a CFG. A grammar G is said to be non-circular if the right-hand side of every \(A_i\)’s rule does not contain a symbol \(A_j\), where \(j \le i\). The non-circularity of a grammar guarantees that the language accepted by the grammar is finite.
 
Literatur
Zurück zum Zitat Bod R (2006) An all-subtrees approach to unsupervised parsing. In: Proceedings of the 21st international conference on computational linguistics and 44th annual meeting of the ACL, association for computational linguistics, pp 865–872 Bod R (2006) An all-subtrees approach to unsupervised parsing. In: Proceedings of the 21st international conference on computational linguistics and 44th annual meeting of the ACL, association for computational linguistics, pp 865–872
Zurück zum Zitat Rulot H, Vidal E (1987) Modelling (sub)string-length based constraints through a grammatical inference method. In: Kittler J, Devijver P (eds) Proceedings of the NATO advanced study institute on pattern recognition theory and applications. Springer, pp 451–459 Rulot H, Vidal E (1987) Modelling (sub)string-length based constraints through a grammatical inference method. In: Kittler J, Devijver P (eds) Proceedings of the NATO advanced study institute on pattern recognition theory and applications. Springer, pp 451–459
Zurück zum Zitat Solan Z, Horn D, Ruppin E, Edelman S (2005) Unsupervised learning of natural languages. Proc Nat Acad Sci USA 102(33):11,629–11,634 Solan Z, Horn D, Ruppin E, Edelman S (2005) Unsupervised learning of natural languages. Proc Nat Acad Sci USA 102(33):11,629–11,634
Zurück zum Zitat van Zaanen M (2000) ABL: alignment-based learning. In:Proceedings of the 18th international conference on computational linguistics (COLING), association for computational linguistics, association for computational linguistics, pp 961–967 van Zaanen M (2000) ABL: alignment-based learning. In:Proceedings of the 18th international conference on computational linguistics (COLING), association for computational linguistics, association for computational linguistics, pp 961–967
Metadaten
Titel
Substring-Based Algorithms
verfasst von
Wojciech Wieczorek
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-46801-3_4