skip to main content
10.1145/3239283.3239317acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdsitConference Proceedingsconference-collections
research-article

A data reduction approach using hyperspherical sectors for support vector machine

Authors Info & Claims
Published:20 July 2018Publication History

ABSTRACT

In this paper, a new geometrical approach is proposed for data reduction to speed up the training of support vector machine (SVM) for large data sets. In this approach, the largest possible hyperspherical sector centered at each pattern and containing patterns of the same class is constructed, then the patterns that tend to be far away from the decision boundary are removed from the training set. Experiments show the effectiveness of the proposed approach in speeding up the training process while maintaining the same level of accuracy as the standard SVM.

References

  1. Angiulli, F. 2007. Fast nearest neighbor condensation for large data sets classification. IEEE Transactions On Knowledge And Data Engineering 19, 11 (November 2007) 1450--1464. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Cervantes, J., Li, X., and Yu, W. 2006. Support vector machine classification based on fuzzy clustering for large datasets. In: MICAI 2006: Advances in Artificial Intelligence, Lecture Notes in Computer Science, 4293, Springer, (2006), 572--582. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chang, C.C., and Lin, C.J., 2011. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 27:1--27:27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chien, L.J., Chang, C. C., and Lee, Y. J. 2010. Variant methods of reduced set selection for reduced support vector machines, Journal of Information Science and Engineering 26, 1, (2010), 183--196.Google ScholarGoogle Scholar
  5. Koggalage, R., and Halgamuge, S. 2004. Reducing the number of training samples for fast support vector machine classification. Neural Information Processing - Letters and Reviews 2, (March 2004), 57--65.Google ScholarGoogle Scholar
  6. Kremer, J., Pedersen, K. S., and Igel, C. 2014. Active learning with support vector machines. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 4,4 (July 2014), 313--326. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Lee, Y. J., and Mangasarian, O. L. 2001. RSVM: Reduced support vector machines. In Proceedings of the First SIAM International Conference on Data Mining (April 2001).Google ScholarGoogle Scholar
  8. Schohn, G., and D. Cohn, D. 2000. Less is more: Active learning with support vector machines. In Proceedings of the International Conference on Machine Learning (ICML), 839--846. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Shen, X., Mu, L., Li, Z., Wu, H., Gou, J., and Chen, X. 2016. Large-scale support vector machine classification with redundant data reduction. Neurocomputing 172 (January 2016), 189--197. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Tsang, I. W., Kwok, J. T., and Cheung, P. M. 2005. Core Vector Machines: Fast SVM Training on Very Large Data Sets. Journal of Machine Learning Research 6 (April 2005) 363--392. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Vapnik, V. 1998. Statistical Learning Theory. Wiley, NY.Google ScholarGoogle Scholar
  12. Wang, J., Neskovic, P., and Cooper, L. N. 2005. Training Data Selection for Support Vector Machines. In Proceedings of the First international conference on Advances in Natural Computation (Changsha, China, Au gust 27--29, 2005), 554--564. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A data reduction approach using hyperspherical sectors for support vector machine

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      DSIT '18: Proceedings of the 2018 International Conference on Data Science and Information Technology
      July 2018
      174 pages
      ISBN:9781450365215
      DOI:10.1145/3239283

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 July 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      DSIT '18 Paper Acceptance Rate31of85submissions,36%Overall Acceptance Rate114of277submissions,41%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader