Abstract
Speaker recognition as one of biometrics techniques is to recognize speaker’s identity. A robust method for speaker recognition with high accuracy of recognition rate is the aim for all relevant researchers. With the rapid development of cloud computing, many complicated tasks in speaker recognition system, such as machine learning, data processing, and information recording, can be implemented in cloud. Meanwhile, speaker recognition can be used as a security and privacy service which are extremely important for the popularization of cloud computing. In this paper, we propose a data decision level fusion method for speaker recognition in cloud environment. After introducing the basic processing of speech recognition process and describing the composition of the speech recognition system based on Dempster-Shafer Evidence Theory of decision level data fusion method, we present a speaker recognition system of cloud computing (SROC) architecture. Through experimental evaluation, SROC can improve the recognition rate and accuracy greatly.
Similar content being viewed by others
References
Zhu, X., Qin, X., & Qiu, M. (2011). Qos-aware fault-tolerant scheduling for real-time tasks on heterogeneous clusters. IEEE Transactions on Computers, 60(6), 800–812.
Qiu, M., & Sha, E.H.-M. (2009). Cost minimization while satisfying hard/soft timing constraints for heterogeneous embedded systems. ACM Transactions on Design Automation of Electronic Systems (TODAES), 14(2), 25.
Qiu, M., Niu, J.-W., Yang, L.T., Qin, X., Zhang, S., & Wang, B. (2010). Energy-aware loop parallelism maximization for multi-core dsp architectures. In Proceedings of the 2010 IEEE/ACM Int’l Conference on Green Computing and Communications & Int’l Conference on Cyber, Physical and Social Computing (pp. 205–212): IEEE Computer Society.
Li, Y., Chen, M., Dai, W., & Qiu, M. (2015). Energy optimization with dynamic task scheduling mobile cloud computing. IEEE Systems Journal 1–10.
Qiu, M., Zhong, M., Li, J., Gai, K., & Zong, Z. (2015). Phase-change memory optimization for green cloud with genetic algorithm. IEEE Transactions on Computers, 64(12), 3528–3540.
Li, Y., Dai, W., Ming, Z., & Qiu, M. (2015). Privacy protection for preventing data over-collection in smart city. IEEE Transactions on Computers, PP(99), 1–13.
Qiu, M., Chen, L., Hu, J., & Qin, X. (2014). Online data allocation for hybrid memories on embedded tele-health systems. In IEEE Int’l Conf. on High Performance Computing and Communications (pp. 574–579). Paris.
Osborne, D.E., Patrangenaru, V., Qiu, M., & Thompson, H.W. (2015). Nonparametric data analysis methods in medical imaging. Geometry Driven Statistics, (p. 182).
Dai, H., Zhao, S., Zhang, J., Qiu, M., & Tao, L. (2015). Security enhancement of cloud servers with a redundancy-based fault-tolerant cache structure. Future Generation Computer Systems.
Qiu, M., Ming, Z., Wang, J., Yang, L., & Xiang, Y. (2014). Enabling cloud computing in emergency management systems. IEEE Cloud Computing, 1(4), 60–67.
Dai, W., Chen, H., Wang, W., & Chen, X. (2013). RMORM: A framework of multi-objective optimization resource management in clouds. In IEEE Ninth World Congress on Services (pp. 488–494).
Qiu, M., Zhong, M., Li, J., Gai, K., & Zong, Z. Phase-change memory optimization for green cloud with genetic algorithm.
Song, P., Xiong, J., Gui, L., Qiu, M., & Zhang, Y. (2015). Resource scheduling for hybrid broadcasting and cellular networks. In Broadband Multimedia Systems and Broadcasting (BMSB), 2015 (pp. 1–6). IEEE.
Shafer, G., & et al. (1976). A mathematical theory of evidence Vol. 1. Princeton: Princeton University Press.
Yujin, Y., Peihua, Z., & Qun, Z. (2010). Research of speaker recognition based on combination of lpcc and mfcc. In 2010 IEEE international conference on intelligent computing and intelligent systems (ICIS), (Vol. 3 pp. 765–767).
Zheng, F., Zhang, G., & Song, Z. (2001). Comparison of different implementations of mfcc. Journal of Computer Science and Technology, 16(6), 582–589.
Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., & Pallett, D.S. (1993). Darpa timit acoustic-phonetic continous speech corpus cd-rom. nist speech disc 1-1.1. NASA STI/Recon Technical Report N (Vol. 93, p. 27403).
Jankowski, C., Kalyanswamy, A., Basson, S., & Spitz, J. (1990). Ntimit: A phonetically balanced, continuous speech, telephone bandwidth speech database. In 1990 International Conference on Acoustics, Speech, and Signal Processing (pp. 109–112). IEEE.
Acknowledgments
This work is supported in partial by NSF CNS 1457506.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jiang, N., Qiu, M. & Dai, W. SROC: A Speaker Recognition with Data Decision Level Fusion Method in Cloud Environment. J Sign Process Syst 86, 123–133 (2017). https://doi.org/10.1007/s11265-015-1100-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-015-1100-7