article

Extreme re-balancing for SVMs: a case study

Authors:
Bhavani Raskutti

Telstra Corporation, Clayton, Victoria, Australia

Telstra Corporation, Clayton, Victoria, Australia
View Profile

,
Adam Kowalczyk

Telstra Corporation, Clayton, Victoria, Australia

Telstra Corporation, Clayton, Victoria, Australia
View Profile

Authors Info & Claims

ACM SIGKDD Explorations Newsletter Volume 6 Issue 1June 2004pp 60–69https://doi.org/10.1145/1007730.1007739

Published:01 June 2004Publication History

ACM SIGKDD Explorations Newsletter

Abstract

There are many practical applications where learning from single class examples is either, the only possible solution, or has a distinct performance advantage. The first case occurs when obtaining examples of a second class is difficult, e.g., classifying sites of "interest" based on web accesses. The second situation is exemplified by the gene knock-out experiments for understanding Aryl Hydrocarbon Receptor signalling pathway that provided the data for the second task of the KDD 2002 Cup, where minority one-class SVMs significantly outperform models learnt using examples from both classes.This paper explores the limits of supervised learning of a two class discrimination from data with heavily unbalanced class proportions. We focus on the case of supervised learning with support vector machines. We consider the impact of both sampling and weighting imbalance compensation techniques and then extend the balancing to extreme situations when one of the classes is ignored completely and the learning is accomplished using examples from a single class.Our investigation with the data for KDD 2002 Cup as well as text benchmarks such as Reuters Newswire shows that there is a consistent pattern of performance differences between one and two-class learning for all SVMs investigated, and these patterns persist even with aggressive dimensionality reduction through automated feature selection. Using insight gained from the above analysis, we generate synthetic data showing similar pattern of performance.

References

D. Bamber. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J. Math. Psych., 12:387--415, 1975.]]Google ScholarCross Ref
R. Centor. The use of ROC curves and their analysis. Med. Decis. Making, 11:102--106, 1991.]]Google ScholarCross Ref
P. K. Chan and S. J. Stolfo. Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection. In Knowledge Discovery and Data Mining, KDD-98, pages 164--168, 1998.]]Google Scholar
P. K. Chan and S. J. Stolfo. Toward scalable learning with non-uniform distributions: Effects and a multi-classifier approach. In http://www1.cs.columbia.edu/ sal/recent-papers.html, 1999.]]Google Scholar
Y. Chen, X. Zhou, and T. Huang. One-class svm for learning in image retrieval. In Proceedings of IEEE International Conference on Image Processing (ICIP'01 Oral), 2001.]]Google Scholar
M. Craven. The Genomics of a Signaling Pathway: A KDD Cup Challenge Task. SIGKDD Explorations, 4(2), 2002.]] Google ScholarDigital Library
N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and other kernel-based learning methods. Cambridge University Press, Cambridge, 2000.]] Google ScholarDigital Library
S. Dumais, J. Platt, D. Heckerman, and M. Sahami. Inductive Learning Algorithms and Representations for Text Categorization. In Seventh International Conference on Information and Knowledge Management, 1998.]] Google ScholarDigital Library
C. Elkan. The foundations of cost-sensitive learning. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pages 973--978, 2001.]] Google ScholarDigital Library
T. Fawcett. Roc graphs: Notes and practical considerations for researchers. In HP Labs Tech Report HPL-2003-4, 2003.]]Google Scholar
D. Hand and R. Till. A simple generalisation of the area under the roc curve for multiple class classification problems. Machine Learning, 45:171--186, 2001.]] Google ScholarDigital Library
N. Japkowicz. Are we better off without counter examples? In Proceedings of the First International ICSC Congress on Computational Intelligence Methods and Applications (CIMA-99), pages 242--248, 1999.]]Google Scholar
N. Japkowicz, C. Myers, and M. Gluck. A novelty detection approach to classification. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pages 518--523, 1995.]] Google ScholarDigital Library
N. Japkowicz and S. Stephen. The class imbalance problem: A systematic study. Intelligent Data Analysis Journal, 6(5), 2002.]] Google ScholarDigital Library
M. Joshi. On Evaluating Performance of Classifiers for Rare Classes. In Proceedings of the Second IEEE International Conference on Data Mining (ICDM'02), 2002.]] Google ScholarDigital Library
A. Kowalczyk and B. Raskutti. One Class SVM for Yeast Regulation Prediction. SIGKDD Explorations, 4(2), 2002.]] Google ScholarDigital Library
M. Kubat, H. R., and S. Matwin. Learning when negative examples abound. In Proceedings of the Ninth European Conference on Machine Learning ECML97, 1997.]] Google ScholarDigital Library
D. Lewis and J. Catlett. Training Text Classifiers by Uncertainty Sampling. In Proceedings of the Seventeenth International ACM SIGIR Conference on Research and Development in Information Retrieval, 1994.]] Google ScholarDigital Library
L. M. Maneivitz and M. Yousef. One-class SVMs for Document Classification. Journal of Machine Learning Research, 2:139--154, 2002.]] Google ScholarDigital Library
J. R. Quinlan. Induction of Decision Trees. Machine Learning, 1(1), (1986).]] Google ScholarDigital Library
B. Raskutti, H. Ferrá, and A. Kowalczyk. Second Order Features for Maximising Text Classification Performance. In Proceedings of the Twelfth European Conference on Machine Learning ECML01, 2001.]] Google ScholarDigital Library
G. Salton and M. J. McGill. Introduction to Modern Information Retrieval. McGraw Hill, 1983.]] Google ScholarDigital Library
B. Schölkopf, J. Platt, J. Shawe-Taylor, A. Smola, and R. Williamson. Estimating the support of a high-dimensional distribution. In Technical Report 99-87, Microsoft Research, 1999. 1999.]]Google Scholar
B. Schölkopf and A. J. Smola. Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, 2001.]] Google ScholarDigital Library
V. Vapnik. Statistical Learning Theory. Wiley, New York, 1998.]] Google ScholarDigital Library
Y. Yang and J. O. Pedersen. A Comparative Study on Feature Selection in Text Categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, 1997.]] Google ScholarDigital Library

Recommendations

EUS SVMs: ensemble of under-sampled SVMs for data imbalance problems
ICONIP'06: Proceedings of the 13 international conference on Neural Information Processing - Volume Part I

Data imbalance occurs when the number of patterns from a class is much larger than that from the other class. It often degenerates the classification performance. In this paper, we propose an Ensemble of Under-Sampled SVMs or EUS SVMs. We applied the ...
Read More
Iris recognition using genetic algorithms and asymmetrical SVMs

With the increasing demand for enhanced security, iris biometrics-based personal identification has become an interesting research topic in the field of pattern recognition. While most state-of-the-art iris recognition algorithms are focused on ...
Read More
SVMs modeling for highly imbalanced classification
Special issue on human computing

Traditional classification algorithms can be limited in their performance on highly unbalanced data sets. A popular stream of work for countering the problem of class imbalance has been the application of a sundry of sampling strategies. In this ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM SIGKDD Explorations Newsletter Volume 6, Issue 1
Special issue on learning from imbalanced datasets
June 2004
117 pages
ISSN:1931-0145
EISSN:1931-0153
DOI:10.1145/1007730
Issue’s Table of Contents

Copyright © 2004 Authors
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 June 2004
Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 225
  Total Citations
  View Citations
- 1,216
  Total Downloads
- Downloads (Last 12 months)26
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Extreme re-balancing for SVMs: a case study

ACM SIGKDD Explorations Newsletter

Abstract

References

Cited By

Recommendations

EUS SVMs: ensemble of under-sampled SVMs for data imbalance problems

Iris recognition using genetic algorithms and asymmetrical SVMs

SVMs modeling for highly imbalanced classification

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Extreme re-balancing for SVMs: a case study

ACM SIGKDD Explorations Newsletter

Abstract

References

Cited By

Recommendations

EUS SVMs: ensemble of under-sampled SVMs for data imbalance problems

Iris recognition using genetic algorithms and asymmetrical SVMs

SVMs modeling for highly imbalanced classification

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media