ABSTRACT
This paper suggests a method for multiclass learning with many classes by simultaneously learning shared characteristics common to the classes, and predictors for the classes in terms of these characteristics. We cast this as a convex optimization problem, using trace-norm regularization and study gradient-based optimization both for the linear case and the kernelized setting.
- Abernethy, J., Bach, F., Evgeniou, T., & Vert, J.-P. (2006). Low-rank matrix factorization with attributes Technical report N24/06/MM). Ecole des Mines de Paris.Google Scholar
- Argyriou, A., Evgeniou, T., & Pontil, M. (2007). Multi-task feature learning. In B. Schöölkopf, J. Platt and T. Hoffman (Eds.), NIPS 19. Cambridge, MA: MIT Press.Google Scholar
- Boyd, S., & Vandenberghe, L. (2004). Convex optimization. Cambridge University Press. Google ScholarDigital Library
- Changizi, M., & Shimojo, S. (2005). Character complexity and redundancy in writing systems over human history. Proc Biol Sci. 2005 Feb 7;272(1560):267--75.Google ScholarCross Ref
- Crammer, K., Keshet, J., & Singer, Y. (2002). Kernel design using boosting.Google Scholar
- Crammer, K., & Singer, Y. (2001). On the algorithmic implementation of multiclass kernel-based vector machines. JMLR. Google ScholarDigital Library
- Dekel, O., Keshet, J., & Singer, Y. (2004). Large margin hierarchical classification. Proceedings of the ICML. Google ScholarDigital Library
- Dekel, O., Shalev-Shwartz, S., & Singer, Y. (2003). Smooth epsilon-insensitive regression by loss symmetrization. Proceedings of the Sixteenth Annual COLT.Google Scholar
- Fazel, M., Hindi, H., & Boyd, S. (2001). A rank minimization heuristic with application to minimum order system approximation. Proceedings American Control Conference.Google ScholarCross Ref
- Fink, M., Shalev-Schwartz, S., Singer, Y., & Ullman, S. (2006). Multiclass online learning by interclass hypotheseis sharing. Proceedings of ICML. Google ScholarDigital Library
- Fink, M., & Ullman, S. (2007). From aardvark to zorro: A benchmark of mammal images.Google Scholar
- Srebro, N., Rennie, J., & Jaakkola, T. (2005). Maximum margin matrix factorization. Advances in NIPS, 17.Google Scholar
- Thrun, S. (1996). Learning to learn: Introduction. Kluwer Academic Publishers.Google Scholar
- Torralba, A., Murphy, K., & Freeman, W. (2004). Sharing visual features for multiclass and multiview object detection. CVPR. Google ScholarDigital Library
- Zhang, J., Marszalek, M., Lazebnik, S., & Schmid, C. (2006). Local features and kernels for classification of texture and object categories: A comprehensive study. CVPR Workshop. Google ScholarDigital Library
- Zhang, T., & Oles, F. J. (2001). Text categorization based on regularized linear classification methods. Information Retrieval Google ScholarDigital Library
- Uncovering shared structures in multiclass classification
Recommendations
Multiclass penalized likelihood pattern classification algorithm
ICONIP'12: Proceedings of the 19th international conference on Neural Information Processing - Volume Part IIIPenalized likelihood is a general approach whereby an objective function is defined, consisting of the log likelihood of the data minus some term penalizing non-smooth solutions. Subsequently, this objective function is maximized, yielding a solution ...
Multiclass support matrix machine for single trial EEG classification
We propose a novel multiclass classifier for single trial electroencephalogram (EEG) data in matrix form, namely multiclass support matrix machine (MSMM), aiming at improving the classification accuracy of multiclass EEG signals, and hence enhancing the ...
Improving multiclass classification using neighborhood search in error correcting output codes
A novel multiclass classification method based on ECOC framework is introduced.It takes advantage of both problem-dependent and problem-independent approaches.It tries to exclude unrelated classifier vote to achieve the final code word.Proposed method ...
Comments