Skip to main content
Top

2017 | OriginalPaper | Chapter

The Effect of Morphological Factors on Sentence Boundaries in Russian Spontaneous Speech

Authors : Anton Stepikhov, Anastassia Loukina

Published in: Speech and Computer

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The paper evaluates the contribution of morphological factors to the probability of sentence boundaries in Russian unscripted monologue. The analysis is based on multiple expert manual annotations of unscripted speech which allow obtaining fine-grained estimates of the probability of sentence boundary at each word junction. We used linear regression analysis to explore whether there is a relationship between sentence boundaries marked by the annotators and the grammatical features of the text. We focused on morphological factors related to the presence or absence of sentence boundaries.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Chi-squared statistics was computed without taking into account numerals and interjections.
 
Literature
1.
go back to reference Shriberg, E.: How people really talk and why engineers should care. In: Proceedings of the Interspeech 2005, pp. 1791–1794 (2005) Shriberg, E.: How people really talk and why engineers should care. In: Proceedings of the Interspeech 2005, pp. 1791–1794 (2005)
2.
go back to reference Magimai-Doss, M., Hakkani-Tür, D., Çetin, Ö., Shriberg, E., Fung, J., Mirghafori, N.: Entropy based classifier combination for sentence segmentation. In: Proceedings of the ICASSP 2007, vol. 4, pp. 180–192 (2007) Magimai-Doss, M., Hakkani-Tür, D., Çetin, Ö., Shriberg, E., Fung, J., Mirghafori, N.: Entropy based classifier combination for sentence segmentation. In: Proceedings of the ICASSP 2007, vol. 4, pp. 180–192 (2007)
3.
go back to reference Liu, Y.-F., Tseng, S.-C., Jang, J.-S.R., Chen, C.-H.A.: Coping imbalanced prosodic unit boundary detection with linguistically motivated prosodic features. In: Proceedings of the Interspeech 2010, pp. 1417–1420 (2010) Liu, Y.-F., Tseng, S.-C., Jang, J.-S.R., Chen, C.-H.A.: Coping imbalanced prosodic unit boundary detection with linguistically motivated prosodic features. In: Proceedings of the Interspeech 2010, pp. 1417–1420 (2010)
4.
go back to reference Liu, Y., Stolcke, A., Shriberg, E., Harper, M.: Using conditional random fields for sentence boundary detection in speech. In: Proceedings of the ACL 2005, pp. 451–458 (2005) Liu, Y., Stolcke, A., Shriberg, E., Harper, M.: Using conditional random fields for sentence boundary detection in speech. In: Proceedings of the ACL 2005, pp. 451–458 (2005)
5.
go back to reference Ueffing, N., Bisani, M., Vozila, P.: Improved models for automatic punctuation prediction for spoken and written text. In: Proceedings of the Interspeech 2013, pp. 3097–3101 (2013) Ueffing, N., Bisani, M., Vozila, P.: Improved models for automatic punctuation prediction for spoken and written text. In: Proceedings of the Interspeech 2013, pp. 3097–3101 (2013)
6.
go back to reference Xu, C., Xie, L., Huang, G., Xiao, X., Chng, E.S., Li, H.: A deep neural network approach for sentence boundary detection in broadcast news. In: Proceedings of the Interspeech 2014, pp. 2887–2891 (2014) Xu, C., Xie, L., Huang, G., Xiao, X., Chng, E.S., Li, H.: A deep neural network approach for sentence boundary detection in broadcast news. In: Proceedings of the Interspeech 2014, pp. 2887–2891 (2014)
7.
go back to reference Liu, Y.-F., Tseng, S.-C., Roger Jang, J.-S., Alvin Chen, C.-H.: Coping imbalanced prosodic unit boundary detection with linguistically motivated prosodic features. In: Interspeech 2010, pp. 1417–1420 (2010) Liu, Y.-F., Tseng, S.-C., Roger Jang, J.-S., Alvin Chen, C.-H.: Coping imbalanced prosodic unit boundary detection with linguistically motivated prosodic features. In: Interspeech 2010, pp. 1417–1420 (2010)
8.
go back to reference Chistikov, P., Khomitsevich, O.: Online automatic sentence boundary detection in a Russian ASR system. In: Proceedings of the SPECOM 2011, pp. 112–117 (2011) Chistikov, P., Khomitsevich, O.: Online automatic sentence boundary detection in a Russian ASR system. In: Proceedings of the SPECOM 2011, pp. 112–117 (2011)
9.
go back to reference Momtazi, S., Faubel, F., Klakow, D.: Within and across sentence boundary language model. In: Proceedings of the Interspeech 2010, pp. 1800–1803 (2010) Momtazi, S., Faubel, F., Klakow, D.: Within and across sentence boundary language model. In: Proceedings of the Interspeech 2010, pp. 1800–1803 (2010)
10.
go back to reference Kolář, J., Liu, Y.: Comparing and combining modeling techniques for sentence segmentation of spoken Czech using textual and prosodic information. In: Proceedings of the Speech Prosody 2010, p. 100021:1–4 (2010) Kolář, J., Liu, Y.: Comparing and combining modeling techniques for sentence segmentation of spoken Czech using textual and prosodic information. In: Proceedings of the Speech Prosody 2010, p. 100021:1–4 (2010)
11.
go back to reference Mori, S., Nishimura, M., Itoh, N.: An automatic sentence boundary detector based on a structured language model. In: Proceedings of the ICSLP 2002, pp. 921–924 (2002) Mori, S., Nishimura, M., Itoh, N.: An automatic sentence boundary detector based on a structured language model. In: Proceedings of the ICSLP 2002, pp. 921–924 (2002)
12.
go back to reference Oba, T., Hori, T., Nakamura, A.: Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking. In: Proceedings of the Interspeech 2006, pp. 1153–1156 (2006) Oba, T., Hori, T., Nakamura, A.: Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking. In: Proceedings of the Interspeech 2006, pp. 1153–1156 (2006)
13.
go back to reference Khomitsevich, O., Chistikov, P., Krivosheeva, T., Epimakhova, N., Chernykh, I.: Combining prosodic and lexical classifiers for two-pass punctuation detection in a Russian ASR system. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 161–169. Springer, Cham (2015). doi:10.1007/978-3-319-23132-7_20 CrossRef Khomitsevich, O., Chistikov, P., Krivosheeva, T., Epimakhova, N., Chernykh, I.: Combining prosodic and lexical classifiers for two-pass punctuation detection in a Russian ASR system. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 161–169. Springer, Cham (2015). doi:10.​1007/​978-3-319-23132-7_​20 CrossRef
14.
go back to reference Stepikhov, A.: Analysis of expert manual annotation of the Russian spontaneous monologue: evidence from sentence boundary detection. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 33–40. Springer, Cham (2013). doi:10.1007/978-3-319-01931-4_5 CrossRef Stepikhov, A.: Analysis of expert manual annotation of the Russian spontaneous monologue: evidence from sentence boundary detection. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 33–40. Springer, Cham (2013). doi:10.​1007/​978-3-319-01931-4_​5 CrossRef
15.
go back to reference Stepikhov, A.: Resolving ambiguities in sentence boundary detection in Russian spontaneous speech. In: Habernal, I., Matoušek, V. (eds.) TSD 2013. LNCS, vol. 8082, pp. 426–433. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40585-3_54 CrossRef Stepikhov, A.: Resolving ambiguities in sentence boundary detection in Russian spontaneous speech. In: Habernal, I., Matoušek, V. (eds.) TSD 2013. LNCS, vol. 8082, pp. 426–433. Springer, Heidelberg (2013). doi:10.​1007/​978-3-642-40585-3_​54 CrossRef
16.
go back to reference Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý, P., Suchomel, V.: The Sketch Engine: ten years on. In: Lexicography ASIALEX, vol. 1, pp. 7–36 (2014). https://www.sketchengine.co.uk Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý, P., Suchomel, V.: The Sketch Engine: ten years on. In: Lexicography ASIALEX, vol. 1, pp. 7–36 (2014). https://​www.​sketchengine.​co.​uk
Metadata
Title
The Effect of Morphological Factors on Sentence Boundaries in Russian Spontaneous Speech
Authors
Anton Stepikhov
Anastassia Loukina
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-66429-3_73

Premium Partner