Weitere Kapitel dieses Buchs durch Wischen aufrufen
The classical linear discriminant analysis (LDA) may perform poorly in multi-class classification with high-dimensional data. We propose a partially supervised sparse factor regression (PSFAR) approach, to jointly explore the potential low-dimensional structures in the high-dimensional class mean vectors and the common covariance matrix required in LDA. The problem is formulated as a multivariate regression analysis, with predictors constructed from the class labels and responses from the high-dimensional features. The regression coefficient matrix is then composed of the class means, for which we explore a sparse and low rank structure; we further explore a parsimonious factor analysis representation in the covariance matrix. As such, our model assumes that the high-dimensional features are best separated in their means in a low-dimensional subspace, subject to a few unobserved latent factors. We propose a regularized log-likelihood criterion for model estimation, for which an efficient Expectation-Maximization algorithm is developed. The efficacy of PSFAR is demonstrated by both simulation studies and a real application using handwritten digit data.
Bitte loggen Sie sich ein, um Zugang zu diesem Inhalt zu erhalten
Sie möchten Zugang zu diesem Inhalt erhalten? Dann informieren Sie sich jetzt über unsere Produkte:
Fisher, R. A. (1936) The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179–188. CrossRef
Friedman, J., Hastie, T. and Tibshirani, R. (2001) The elements of statistical learning. Springer Series in Statistics Springer, Berlin. MATH
Friedman, J., Hastie, T. and Tibshirani, R. (2010) Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33, 1. CrossRef
Glahn, H. R. (1968) Canonical correlation and its relationship to discriminant analysis and multiple regression. Journal of the Atmospheric Sciences, 25, 23–31. CrossRef
LeCun, Y., Bottou, L., Bengio, Y. and Haffner, P. (1998) Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278–2324.
Li, T., Zhu, S. and Ogihara, M. (2006) Using discriminant analysis for multi-class classification: an experimental investigation. Knowledge and Information Systems, 10, 453–472. CrossRef
Lorena, A. C., Carvalho, A. C. P. L. F. and Gama, J. M. P. (2009) A review on the combination of binary classifiers in multiclass problems. Artificial Intelligence Review, 30, 19–37. CrossRef
McLachlan, G. (2004) Discriminant analysis and statistical pattern recognition, vol. 544. John Wiley & Sons.
Vapnik, V. N. and Vapnik, V. (1998) Statistical learning theory. Wiley New York. MATH
Zhang, M.-L. and Zhou, Z.-H. (2014) A review on multi-label learning algorithms. Knowledge and Data Engineering, IEEE Transactions on, 26, 1819–1837. CrossRef
- Partially Supervised Sparse Factor Regression For Multi-Class Classification
Neuer Inhalt/© ITandMEDIA