Skip to main content
Erschienen in: International Journal of Speech Technology 3/2018

16.07.2018

Agricultural price information acquisition using noise-robust Mandarin auto speech recognition

verfasst von: Jinpu Xu, Yeping Zhu, Pengmin Xu, Dexin Ma

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speech-recognition technology was applied to collect agricultural-price information. In this paper, we propose a robust continuous Mandarin speech-recognition method suitable for environments where agricultural product prices are acquired. To mitigate the decrease in recognition rate caused by the mismatch between training and real tests, we developed acoustic models based on the Hidden Markov Model (HMM) and trained the models by collecting data in different environments. The results showed that the recognition performance of triphone models was superior to that of monophone models. Both male and female HMMs performed better than the male and female mixed acoustic models. Although the decision-tree clustering method could not significantly improve the recognition rate, it evidently would reduce the quantity of triphone models. Gaussian mixture components improved the recognition rate on one hand, but they increased the calculation tasks on the other hand. The cepstral mean normalization and cepstral variance normalization methods significantly improved the identification-system performance. Under different locations and different speaker tests, the methods we used demonstrated varying degrees of improvement in recognition performance. The ultimate recognition rates were 95.04% for the males and 97.62% for the females. Speech-recognition technology can possibly be applied to collection of agricultural-price information. The experimental results showed that the models trained by these methods exhibited good recognition performance. Furthermore, the approach adopted by our research lays the foundation for the development of an application system in the future.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Cao, Y., Teng, G., Yu, L., & Li, Q. (2014). Comparison of different de-noising methods in vocalization environment of laying hens including fan noise. Nongye Gongcheng Xuebao/transactions of the Chinese Society of Agricultural Engineering, 30(2), 212–218. (in Chinese). Cao, Y., Teng, G., Yu, L., & Li, Q. (2014). Comparison of different de-noising methods in vocalization environment of laying hens including fan noise. Nongye Gongcheng Xuebao/transactions of the Chinese Society of Agricultural Engineering, 30(2), 212–218. (in Chinese).
Zurück zum Zitat Dai, J. G., Wang, K. R., Li, S. K., Li, S. M., & Wang, Q. (2012). Designing and implementation of crop production management information system based on state-operated farm. Scientia Agricultura Sinica, 45(11), 2159–2167. (in Chinese). Dai, J. G., Wang, K. R., Li, S. K., Li, S. M., & Wang, Q. (2012). Designing and implementation of crop production management information system based on state-operated farm. Scientia Agricultura Sinica, 45(11), 2159–2167. (in Chinese).
Zurück zum Zitat Gao, S., Bo, X. U., & Huang, T. (2000). Ttiphone models for mandarin speech recognition based on decision tree. Acta Acustica, 25(06), 504–509. (in Chinese). Gao, S., Bo, X. U., & Huang, T. (2000). Ttiphone models for mandarin speech recognition based on decision tree. Acta Acustica, 25(06), 504–509. (in Chinese).
Zurück zum Zitat Gurpreet, S., Akhil, J., & Priyam, C. (2010). Multi utility E-controlled cum voice operated farm vehicle. International Journal of Computer Applications, 1(13), 109–113. Gurpreet, S., Akhil, J., & Priyam, C. (2010). Multi utility E-controlled cum voice operated farm vehicle. International Journal of Computer Applications, 1(13), 109–113.
Zurück zum Zitat Li, C., & Wang, Z. Y. (2003). A new acoustic modeling of inter-syllable context-dependent units for Putonghua continuous speech recognition. Acta Acustica, 28(2), 187–191. (in Chinese). Li, C., & Wang, Z. Y. (2003). A new acoustic modeling of inter-syllable context-dependent units for Putonghua continuous speech recognition. Acta Acustica, 28(2), 187–191. (in Chinese).
Zurück zum Zitat Li, J., Fang, Z., & Zhang, J. (2004). Context dependent initial/final acoustic modeling for continuous Chinese speech recognition. Journal of Tsinghua University, 44(1), 61–64. (in Chinese). Li, J., Fang, Z., & Zhang, J. (2004). Context dependent initial/final acoustic modeling for continuous Chinese speech recognition. Journal of Tsinghua University, 44(1), 61–64. (in Chinese).
Zurück zum Zitat Li, Y. G., Pu, F. A., & Zheng, F. (2012). Statistical threshholding for robust ASR. Journal of ChongQing University of Posts and Telecommunications: Natural Science Edition, 24(02), 127–132. Li, Y. G., Pu, F. A., & Zheng, F. (2012). Statistical threshholding for robust ASR. Journal of ChongQing University of Posts and Telecommunications: Natural Science Edition, 24(02), 127–132.
Zurück zum Zitat Mantena, G. V., Rajendran, S., Rambabu, B., & Gangashetty, S. V. (2011). A speech-based conversation system for accessing agriculture commodity prices in Indian languages. The Workshop on Hands-Free Speech Communication & Microphone Arrays, IEEE. pp. 153–154. Mantena, G. V., Rajendran, S., Rambabu, B., & Gangashetty, S. V. (2011). A speech-based conversation system for accessing agriculture commodity prices in Indian languages. The Workshop on Hands-Free Speech Communication & Microphone Arrays, IEEE. pp. 153–154.
Zurück zum Zitat Ni, C. J., Liu, W. J., & Xu, B. (2009). Research on large vocabulary continuous speech recognition for mandarin Chinese. Journal of Chinese Information Processing, 23(01), 112–123. (in Chinese). Ni, C. J., Liu, W. J., & Xu, B. (2009). Research on large vocabulary continuous speech recognition for mandarin Chinese. Journal of Chinese Information Processing, 23(01), 112–123. (in Chinese).
Zurück zum Zitat Ou, W., Gao, W., Li, Z., & Zhang, S. (2010). Application of keywords speech recognition in agricultural voice information system. 2nd International Conference on Computational Intelligence and Natural Computing Proceedings. Vol. 2, pp. 197–200. Ou, W., Gao, W., Li, Z., & Zhang, S. (2010). Application of keywords speech recognition in agricultural voice information system. 2nd International Conference on Computational Intelligence and Natural Computing Proceedings. Vol. 2, pp. 197–200.
Zurück zum Zitat Plauche, M., Nallasamy, U., Pal, J., Wooters, C., & Ramachandran, D. (2007). Speech Recognition for Illiterate Access to Information and Technology. International Conference on Information and Communication Technologies and Development. Vol. 67, pp. 83–92. Plauche, M., Nallasamy, U., Pal, J., Wooters, C., & Ramachandran, D. (2007). Speech Recognition for Illiterate Access to Information and Technology. International Conference on Information and Communication Technologies and Development. Vol. 67, pp. 83–92.
Zurück zum Zitat Qi, Y. H., Pan, F. P., Ge, F. P., & Yan, Y. H. (2013). Refining triphone model in mandarin continuous speech recognition. Application Research of Computers, 30(10), 2920–2922. (in Chinese). Qi, Y. H., Pan, F. P., Ge, F. P., & Yan, Y. H. (2013). Refining triphone model in mandarin continuous speech recognition. Application Research of Computers, 30(10), 2920–2922. (in Chinese).
Zurück zum Zitat Qian, J. P., Wu, X. M., Fan, B. L., Yang, X. T., Sun, L. X., & Chen, M. (2013). A solution for improving vegetable circulation traceability precision based on barcode-rfid correspondence. Scientia Agricultura Sinica, 46(18), 3857–3863. (in Chinese). Qian, J. P., Wu, X. M., Fan, B. L., Yang, X. T., Sun, L. X., & Chen, M. (2013). A solution for improving vegetable circulation traceability precision based on barcode-rfid correspondence. Scientia Agricultura Sinica, 46(18), 3857–3863. (in Chinese).
Zurück zum Zitat Tian, W. J., Shen, C. J., Zheng, W. G., Zhang, S. R., & Zhou, G. H. (2012). Design and implementation of agricultural products price information acquisition and early warning system. Computer Engineering and Design, 33(5), 1816–1821. (in Chinese). Tian, W. J., Shen, C. J., Zheng, W. G., Zhang, S. R., & Zhou, G. H. (2012). Design and implementation of agricultural products price information acquisition and early warning system. Computer Engineering and Design, 33(5), 1816–1821. (in Chinese).
Zurück zum Zitat Xiao, Y. P., & Ye, W. P. (2010). Survey of feature normalization techniques for robust speech recognition. Journal of Chinese Information Processing, 24(05), 106–116. (in Chinese). Xiao, Y. P., & Ye, W. P. (2010). Survey of feature normalization techniques for robust speech recognition. Journal of Chinese Information Processing, 24(05), 106–116. (in Chinese).
Zurück zum Zitat Xu, S. W., Zhang, Y. E., LI, Z. Q., Li, Z. M., & Kong, F. T. (2011). Research on standard and classification coding system of holographic information of agricultural products market. Food and Nutrition in China, 17(12), 5–8. (in Chinese). Xu, S. W., Zhang, Y. E., LI, Z. Q., Li, Z. M., & Kong, F. T. (2011). Research on standard and classification coding system of holographic information of agricultural products market. Food and Nutrition in China, 17(12), 5–8. (in Chinese).
Zurück zum Zitat Xu, X. H., Zhu, J., & Guo, Q. (2004). A Hierarchical clustering algorithm in continuous mandarin speech recognition. Signal Processing, 20(05), 497–500. (in Chinese). Xu, X. H., Zhu, J., & Guo, Q. (2004). A Hierarchical clustering algorithm in continuous mandarin speech recognition. Signal Processing, 20(05), 497–500. (in Chinese).
Zurück zum Zitat Yao, X., Luo, M., & Yang, G. Q. (2012). Research and design of pen-based interaction agricultural information collection and dissemination system. Computer and Mordenlization, 4, 71–75. (in Chinese). Yao, X., Luo, M., & Yang, G. Q. (2012). Research and design of pen-based interaction agricultural information collection and dissemination system. Computer and Mordenlization, 4, 71–75. (in Chinese).
Zurück zum Zitat Zhang, S. R., Zheng, W. G., Shen, C. J., & Xing, Z. (2012). Agricultural product price information collection terminal of embedded portable wireless. Computer Engineering and Design, 33(2), 514–518. (in Chinese). Zhang, S. R., Zheng, W. G., Shen, C. J., & Xing, Z. (2012). Agricultural product price information collection terminal of embedded portable wireless. Computer Engineering and Design, 33(2), 514–518. (in Chinese).
Zurück zum Zitat Zhao, C. J., Shen, C. J., Xing, Z., Zheng, W. G., Bao, F., & Wu, W. B. (2011). A device and method of agricultural product information acquisition. China Patent, CN102122430A. Zhao, C. J., Shen, C. J., Xing, Z., Zheng, W. G., Bao, F., & Wu, W. B. (2011). A device and method of agricultural product information acquisition. China Patent, CN102122430A.
Zurück zum Zitat Zhao, L., Wang, H., Zhan, Z., & Kong, X. (2008). Research advances in insect acoustic signals and their applications. Plant Protection, 34(4), 5–12. (in Chinese). Zhao, L., Wang, H., Zhan, Z., & Kong, X. (2008). Research advances in insect acoustic signals and their applications. Plant Protection, 34(4), 5–12. (in Chinese).
Zurück zum Zitat Zhu, L. Q., & Zhang, Z. (2012). Automatic recognition of insect sounds using mfcc and gmm. Acta Entomologica Sinica, 55(4), 466–471. (in Chinese). Zhu, L. Q., & Zhang, Z. (2012). Automatic recognition of insect sounds using mfcc and gmm. Acta Entomologica Sinica, 55(4), 466–471. (in Chinese).
Metadaten
Titel
Agricultural price information acquisition using noise-robust Mandarin auto speech recognition
verfasst von
Jinpu Xu
Yeping Zhu
Pengmin Xu
Dexin Ma
Publikationsdatum
16.07.2018
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 3/2018
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-018-9532-7

Weitere Artikel der Ausgabe 3/2018

International Journal of Speech Technology 3/2018 Zur Ausgabe

Neuer Inhalt