Skip to main content

2019 | OriginalPaper | Buchkapitel

Chinese Dialects Identification Using Attention-Based Deep Neural Networks

verfasst von : Yuanhang Qiu, Yong Ma, Yun Jin, Shidang Li, Mingliang Gu

Erschienen in: Communications, Signal Processing, and Systems

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a novel Chinese dialects identification system. We use attention-based deep neural networks (AB-DNN) to obtain the Chinese dialects model as back-end. The front-end fuses identity vector (i-vector) with the global prosodic information as input used to describe the dialectal category information accurately. In the task, five kinds of Chinese dialects including Min, Yue, Wu, Jianghuai, Zhongyuan and standard Mandarin are selected as the identification objects. Experimental results show that 21.1% relative equal error rate (EER) reduction is obtained compared with regular deep neural networks (DNN) and further 14.5% reduction when apply global fusion features. The method based on AB-DNN combined with global fusion features observes 29.2% performance improvement compared to traditional DNN with MFCC.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Resnick, M.C.: Dialect zones and automatic dialect identification in Latin American Spanish. Hispania 52, 553–568 (1969) Resnick, M.C.: Dialect zones and automatic dialect identification in Latin American Spanish. Hispania 52, 553–568 (1969)
2.
Zurück zum Zitat Mingliang, G.U., Zhaoyong, S.H.E.N.: Phonotatics based Chinese dialects identification. J. Chin. Inf. Proc. 20(5), 77–82 (2006) Mingliang, G.U., Zhaoyong, S.H.E.N.: Phonotatics based Chinese dialects identification. J. Chin. Inf. Proc. 20(5), 77–82 (2006)
3.
Zurück zum Zitat Etman, A., Louis, A.A.: American dialect identification using phonotactic and prosodic features. In: SAI Intelligent Systems Conference, pp. 963–970. IEEE (2015) Etman, A., Louis, A.A.: American dialect identification using phonotactic and prosodic features. In: SAI Intelligent Systems Conference, pp. 963–970. IEEE (2015)
4.
Zurück zum Zitat Zhang, Q., Bořil, H., Hansen, J.H.L.: Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification. In: ICASSP, pp. 7363–7367. IEEE (2013) Zhang, Q., Bořil, H., Hansen, J.H.L.: Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification. In: ICASSP, pp. 7363–7367. IEEE (2013)
5.
Zurück zum Zitat Luong, M.T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation. arXiv:1508.04025 (2015) Luong, M.T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation. arXiv:​1508.​04025 (2015)
6.
Zurück zum Zitat Chorowski, J.K., Bahdanau, D., Serdyuk, D., et al.: Attention-based models for speech recognition. In: Advances in Neural Information Processing Systems, pp. 577–585 (2015) Chorowski, J.K., Bahdanau, D., Serdyuk, D., et al.: Attention-based models for speech recognition. In: Advances in Neural Information Processing Systems, pp. 577–585 (2015)
7.
Zurück zum Zitat Xu, K., Ba, J., Kiros, R., et al.: Show, attend and tell: neural image caption generation with visual attention. ICML 14, 77–81 (2015) Xu, K., Ba, J., Kiros, R., et al.: Show, attend and tell: neural image caption generation with visual attention. ICML 14, 77–81 (2015)
8.
Zurück zum Zitat Raffel, C., Ellis, D.P.W.: Feed-forward networks with attention can solve some long-term memory problems. arXiv:1512.08756 (2015) Raffel, C., Ellis, D.P.W.: Feed-forward networks with attention can solve some long-term memory problems. arXiv:​1512.​08756 (2015)
9.
Zurück zum Zitat Kenny, P., Boulianne, G., Ouellet, P., et al.: Joint factor analysis versus Eigenchannels in speaker recognition. Trans. Audio Speech Lang. Process. 15(4), 1435–1447 (2007) Kenny, P., Boulianne, G., Ouellet, P., et al.: Joint factor analysis versus Eigenchannels in speaker recognition. Trans. Audio Speech Lang. Process. 15(4), 1435–1447 (2007)
10.
Zurück zum Zitat Dehak, N., Kenny, P., Dehak, R., et al.: Front-end factor analysis for speaker verification. Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011) Dehak, N., Kenny, P., Dehak, R., et al.: Front-end factor analysis for speaker verification. Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Metadaten
Titel
Chinese Dialects Identification Using Attention-Based Deep Neural Networks
verfasst von
Yuanhang Qiu
Yong Ma
Yun Jin
Shidang Li
Mingliang Gu
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6571-2_250

Neuer Inhalt