Abstract
Due to some correlative or repetitive factors between features or samples with high dimension and large amount of sample data, when traditional back-propagation (BP) neural network is used to solve this classification problem, it will present a series of problems such as network structural redundancy, low learning efficiency, occupation of storage space, consumption of computing time, and so on. All of these problems will restrict the operating efficiency and classification precision of neural network. To avoid them, partial least squares (PLS) algorithm is used to reduce the feature dimension of original data into low-dimensional data as the input of BP neural network, so that it can simplify the structure and accelerate convergence, thus improving the training speed and operating efficiency. In order to improve the classification precision of BP neural network by using hierarchical cluster analysis (HCA), similar samples are put into a sub-class, and some different sub-classes can be obtained. For each sub-class, a different training session can be conducted to find a corresponding precision BP neural network model, and the simulation samples of different sub-classes can be recognized by the corresponding network model. In this paper, the theories of PLS and HCA are combined together with the property of BP neural network, and an optimized classification algorithm by BP neural network based on PLS and HCA (PLS-HCA-BP algorithm) is proposed. The new algorithm is aimed at improving the operating efficiency and classification precision so as to provide a more reliable and more convenient tool for complex pattern classification systems. Three experiments and comparisons with four other algorithms are carried out to verify the superiority of the proposed algorithm, and the results indicate a good picture of the PLS-HCA-BP algorithm, which is worthy of further promotion.
Similar content being viewed by others
References
Ding SF, Jia WK, Su CY, Zhang LW, Shi ZZ (2008) Neural network research progress and applications in forecast. Lect Notes Comput Sci 5264:783–793
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representation by Back-Propagating errors. Nature 3(6):533–536
Khoshnevisan B, Rafiee S, Omid M (2013) Prognostication of environmental indices in potato production using artificial neural networks. J Clean Prod 52:402–409
Xu LQ, Liu SY (2013) Study of short-term water quality prediction model based on wavelet neural network. Math Comput Model 58(3-4):801–807
Azami H, Mosavi MR, Sanei S (2013) Classification of GPS satellites using improved back propagation training algorithms. Wirel Pers Commun 71(2):789–803
Ma YY, Zhu LP (2013) A review on dimension reduction. Int Stat Rev 81(1):134–150
Jia WK, Ding SF, Xu XZ, Su CY, Shi ZZ (2011) Factor analysis feature extraction algorithm based on shannon entropy. Pattern Recognit Artif Intell 24(3):327–331
Blum MGB, Nunes MA, Prangle D, Sisson SA (2013) A comparative review of dimension reduction methods in approximate bayesian computation. Stat Sci 28(2):189–208
Garimella S, Hermansky H (2013) Factor analysis of auto-associative neural networks with application in speaker verification. IEEE Trans Neural Netw Learn Syst 24(4):522–528
Pradhan B, Lee S (2010) Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modeling. Environ Model Softw 25(6):747–759
Chattopadhyay S, Chattopadhyay G (2008) A factor analysis and neural network-based validation of the Varotsos-Cracknell theory on the 11-year solar cycle. Int J Remote Sens 29(9):2775–2786
Boukhatem B, Kenai S, Hamou AT et al (2012) Predicting concrete properties using neural networks (NN) with principal component analysis (PCA) technique. Comput Concr 10(6):557–573
Xiao YQ, He YG (2011) A novel approach for analog fault diagnosis based on neural networks and improved kernel PCA. Neurocomputing 74(7):1102–1115
Bartecki K (2012) Neural network-based PCA: an application to approximation of a distributed parameter system. Lect Notes Artif Intell 7267:3–11
Gopi ES (2007) Digital image forgery detection using artificial neural network and independent component analysis. Appl Math Comput 194(2):540–543
Kuo SC, Lin CJ, Liao JR (2011) 3D reconstruction and face recognition using kernel-based ICA and neural networks. Expert Syst Appl 38(5):5406–5415
Raju KS, Kumar DN (2007) Classification of Indian meteorological stations using cluster and fuzzy cluster analysis, and Kohonen artificial neural networks. Nord Hydrol 38(3):303–314
Garlik B, Krivan M (2013) Identification of type daily diagrams of electric consumption based on cluster analysis of multi-dimensional data by neural network. Neural Netw World 23(3):271–283
Leiva-Valdebenito SA, Torres-Aviles FJ (2010) A review of the most common partition algorithms in cluster analysis: a comparative study. Rev Colomb Estadistica 33(2):321–339
Richard AJ, Dean WW (2007) Applied multivariate statistical analysis In: 6th. Prentice-Hall, Englewood Cliffs
Lin SK, Zhang DY, Li WX, Zhang TC, Zhang YF (2005) Neural network forecasting model based on clustering and principle components analysis. Mini-micro Syst 26(12):2160–2163
Ding SF, Jia WK, Jin FX, Zhang LW, Liu LL (2011) The research of neural network algorithm based on factor analysis and cluster analysis. Neural Comput & Applic 20(2):297–302
Wang HW (2000) Partial least squares regression method and application. National Defense Industry Press, Beijing
Ding SF, Jin FX, Shi ZZ (2005) Information feature compression based on partial least squares. J Comput Aided Des Comput Graph 17(2):368–371
Gan XS, Duanmub JH, Wang JF, Gong W (2013) Anomaly intrusion detection based on PLS feature extraction and core vector machine. Knowl.-Based Syst. 40:1–6
Marques J, Erik D (2011) Texture analysis by a PLS based method for combined feature extraction and selection. Lect Notes Comput Sci 7009:109–116
Janik LJ, Forrester ST, Rawson A (2009) The prediction of soil chemical and physical properties from mid-infrared spectroscopy and combined partial least-squares regression and neural networks (PLS-NN) analysis. Chemometr Intell Lab Syst 97(2):179–188
Ding SF, Jia WK, Xu XZ, Su CY (2010) Elman neural network algorithm based on PLS. Acta Electron Sin 38(2A):71–75
Li RH, Meng GX, Gao NK, Xie HK (2007) Combined use of partial least-squares regression and neural network for residual life estimation of large generator stator insulation. Meas Sci Technol 18(7):2074–2082
Gao DQ (1998) On structures of supervised linear basis function feed forward three-layered neural networks. Chin J Comput 21(1):80–86
Zhang YM (2003) The application of artificial neural network in the forecasting of wheat midge. Northwest A&F University
http://www.ics.uci.edu/~mlearn/databases/ Waveform Database Generator (Version 2)
Wang XY, Garibaldi JM, Bird B, George MW (2007) A novel fuzzy clustering algorithm for the analysis of axillary lymph node tissue sections. Appl Intell 27(3):237–248
Schwenker F, Trentin E (2014) Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recogn Lett 37:4–14
Ghaseminezhad MH, Karami A (2011) A novel self-organizing map (SOM) neural network for discrete groups of data clustering. Appl Soft Comput 11(4):3771–3778
Avci E (2012) An expert target recognition system using a genetic wavelet neural network. Appl Intell 37(4):475–487
Siniscalchi SM, Yu D, Deng L (2013) Exploiting deep neural networks for detection-based speech recognition. Neurocomputing 106:148–157
Matthews SG, Gongora MA, Hopgood AA (2013) Evolutionary algorithms and fuzzy sets for discovering temporal rules. Int J Appl Math Comput Sci 23(4):855–868
Lee SH, Lim JS (2014) Minimum feature selection for epileptic seizure classification using wavelet-based feature extraction and a fuzzy neural network. Appl Math Inform Sci 8(3):1295–1300
Acknowledgments
This work is supported by the National Natural Science Foundation of China (No. 60875052, 61203014, 61379101); Priority Academic Program Development of Jiangsu Higher Education Institutions; Major Projects in the National Science & Technology Pillar Program during the Twelfth Five-year Plan Period (No. 2011BAD20B06); The Specialized Research Fund for the Doctoral Program of Higher Education of China (No. 20133227110024); Ordinary University Graduate Student Research Innovation Projects of Jiangsu Province (No. KYLX 14_1062).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jia, W., Zhao, D., Shen, T. et al. An optimized classification algorithm by BP neural network based on PLS and HCA. Appl Intell 43, 176–191 (2015). https://doi.org/10.1007/s10489-014-0618-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-014-0618-x