Skip to main content

2015 | OriginalPaper | Buchkapitel

A New Approach to Syllabification of Words in Gujarati

verfasst von : Harsh Trivedi, Aanal Patel, Prasenjit Majumder

Erschienen in: Mining Intelligence and Knowledge Exploration

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a statistical approach for automatic syllabification of words in Gujarati. Gujarati is a resource poor language and hardly any work for its syllabification has been reported, to the best our knowledge. Specifically, lack of enough training data makes this task difficult to perform. A training corpus of 14 thousand Gujarati words is built and a new approach to syllabification in Gujarati is tested on it. The maximum word and syllable level accuracies achieved are 91.89 % and 98.02 % respectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Assumption verified and corrected by a Gujarati linguist.
 
2
Illegal syllables are the character sequences which do not occur as a syllable in the training data.
 
3
Any reference to prefix and suffix in this paper henceforth would refer to first and last syllable of the word respectively.
 
4
Maximum Probable approach refers to method described in Subsect. 3.1 and Prefix/Suffix approach refers to add-on method described in Subsect. 3.2.
 
Literatur
1.
Zurück zum Zitat Bartlett, S., Kondrak, G., Cherry, C.: On the syllabification of phonemes. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 308–316. Association for Computational Linguistics (2009) Bartlett, S., Kondrak, G., Cherry, C.: On the syllabification of phonemes. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 308–316. Association for Computational Linguistics (2009)
2.
Zurück zum Zitat Dinu, L.P., Niculae, V., Sulea, O.-M.: Romanian syllabication using machine learning. In: Habernal, I., Matoušek, V. (eds.) TSD 2013. LNCS, vol. 8082, pp. 450–456. Springer, Heidelberg (2013) Dinu, L.P., Niculae, V., Sulea, O.-M.: Romanian syllabication using machine learning. In: Habernal, I., Matoušek, V. (eds.) TSD 2013. LNCS, vol. 8082, pp. 450–456. Springer, Heidelberg (2013)
3.
Zurück zum Zitat Goslin, J., Frauenfelder, U.H.: A comparison of theoretical and human syllabification. Lang. Speech 44(4), 409–436 (2001)CrossRef Goslin, J., Frauenfelder, U.H.: A comparison of theoretical and human syllabification. Lang. Speech 44(4), 409–436 (2001)CrossRef
4.
Zurück zum Zitat Hammond, M.: Parsing syllables: Modeling ot computationally. arXiv preprint cmp-lg/9710004 (1997) Hammond, M.: Parsing syllables: Modeling ot computationally. arXiv preprint cmp-lg/9710004 (1997)
5.
Zurück zum Zitat Kahn, D.: Syllable-based generalizations in English phonology, vol. 156. Indiana University Linguistics Club Bloomington (1976) Kahn, D.: Syllable-based generalizations in English phonology, vol. 156. Indiana University Linguistics Club Bloomington (1976)
6.
Zurück zum Zitat Kiraz, G.A., Möbius, B.: Multilingual syllabification using weighted finite-state transducers. In: The Third ESCA/COCOSDA Workshop (ETRW) on Speech Synthesis (1998) Kiraz, G.A., Möbius, B.: Multilingual syllabification using weighted finite-state transducers. In: The Third ESCA/COCOSDA Workshop (ETRW) on Speech Synthesis (1998)
7.
Zurück zum Zitat Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data (2001) Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data (2001)
8.
Zurück zum Zitat Mayer, T.: Toward a totally unsupervised, language-independent method for the syllabification of written texts. In: Proceedings of the 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology, pp. 63–71. Association for Computational Linguistics (2010) Mayer, T.: Toward a totally unsupervised, language-independent method for the syllabification of written texts. In: Proceedings of the 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology, pp. 63–71. Association for Computational Linguistics (2010)
9.
Zurück zum Zitat Palchowdhury, S., Majumder, P., Pal, D., Bandyopadhyay, A., Mitra, M.: Overview of FIRE 2011. In: Majumder, P., Mitra, M., Bhattacharyya, P., Subramaniam, L.V., Contractor, D., Rosso, P. (eds.) FIRE 2010 and 2011. LNCS, vol. 7536, pp. 1–12. Springer, Heidelberg (2013) CrossRef Palchowdhury, S., Majumder, P., Pal, D., Bandyopadhyay, A., Mitra, M.: Overview of FIRE 2011. In: Majumder, P., Mitra, M., Bhattacharyya, P., Subramaniam, L.V., Contractor, D., Rosso, P. (eds.) FIRE 2010 and 2011. LNCS, vol. 7536, pp. 1–12. Springer, Heidelberg (2013) CrossRef
11.
Zurück zum Zitat Selkirk, E.O.: On the major class features and syllable theory (1984) Selkirk, E.O.: On the major class features and syllable theory (1984)
Metadaten
Titel
A New Approach to Syllabification of Words in Gujarati
verfasst von
Harsh Trivedi
Aanal Patel
Prasenjit Majumder
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-26832-3_59