Support Vector Data Description

Tax, David M.J.; Duin, Robert P.W.

doi:10.1023/B:MACH.0000008084.60811.49

Support Vector Data Description

Published: January 2004

Volume 54, pages 45–66, (2004)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Support Vector Data Description

Download PDF

David M.J. Tax¹ &
Robert P.W. Duin

24k Accesses
2289 Citations
15 Altmetric
Explore all metrics

Abstract

Data domain description concerns the characterization of a data set. A good description covers all target data but includes no superfluous space. The boundary of a dataset can be used to detect novel data or outliers. We will present the Support Vector Data Description (SVDD) which is inspired by the Support Vector Classifier. It obtains a spherically shaped boundary around a dataset and analogous to the Support Vector Classifier it can be made flexible by using other kernel functions. The method is made robust against outliers in the training set and is capable of tightening the description by using negative examples. We show characteristics of the Support Vector Data Descriptions using artificial and real data.

References

Barnett, V., & Lewis, T. (1978). Outliers in Statistical Data, 2nd ed. Wiley series in probability and mathematical statistics. John Wiley & Sons Ltd.
Bishop, C. (1994). Novelty detection and neural network validation. IEE Proceedings on Vision, Image and Signal Processing. Special Issue on Applications of Neural Networks, 141:4, 217–222.
Google Scholar
Bishop, C. (1995). Neural Networks for Pattern Recognition. Oxford University Press, Walton Street, Oxford OX2 6DP.
Google Scholar
Blake, C., Keogh, E., & Merz, C. (1998). UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html, University of California, Irvine, Dept. of Information and Computer Sciences.
Bradley, A. (1997). The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30:7, 1145–1159.
Google Scholar
Duin, R. (1976). On the choice of the smoothing parameters for Parzen estimators of probability density functions. IEEE Transactions on Computers, C-25:11, 1175–1179.
Google Scholar
Japkowicz, N., Myers, C., & Gluck, M. (1995). A novelty detection approach to classification. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (pp. 518–523).
Koch, M., Moya, M., Hostetler, L., & Fogler, R. (1995). Cueing, feature discovery and one-class learning for synthetic aperture radar automatic target recognition. Neural Networks, 8:7/8, 1081–1102.
Google Scholar
MacKay, D. (1992). Bayesian methods for adaptive models. Master's thesis, California Institute of Technology, Pasadena, California.
Google Scholar
Metz, C. (1978). Basic principles of ROC analysis. Seminars in Nuclear Medicine, VIII:4.
Moya, M., & Hush, D. (1996). Network contraints and multi-objective optimization for one-class classification. Neural Networks, 9:3, 463–474.
Google Scholar
Moya, M., Koch, M., & Hostetler, L. (1993). One-class classifier networks for target recognition applications. In Proceedings World Congress on Neural Networks (pp. 797–801). Portland, OR: International Neural Network Society.
Google Scholar
Parra, L., Deco, G., & Miesbach, S. (1996). Statistical independence and novelty detection with information preserving nonlinear maps. Neural Computation, 8, 260–269.
Google Scholar
Richard, M., & Lippmann, R. (1991). Neural network classifiers estimate Bayesian a posteriori probabilities. Neural Computation, 3, 461–483.
Google Scholar
Ripley, B. (1996). Pattern Recognition and Neural Networks. Cambridge: Cambridge University Press.
Google Scholar
Ritter, G., & Gallegos, M. (1997). Outliers in statistical pattern recognition and an application to automatic chromosome classification. Pattern Recognition Letters, 18, 525–539.
Google Scholar
Roberts, S., Tarassenko, L., Pardey, J., & Siegwart, D. (1994). A validation index for artificial neural networks. In Proceedings of Int. Conference on Neural Networks and Expert Systems in Medicine and Healthcare (pp. 23–30).
Roberts, S., & Penny, W. (1996). Novelty, confidence and errors in connectionist systems. Tech. rep., Imperial College, London. TR-96-1.
Google Scholar
Rosen, J. (1965). Pattern seperation by convex programming. Journal of Mathematical Analysis and Applications, 10:1, 123–134.
Google Scholar
Schölkopf, B. (1997). Support vector learning. Ph.D. thesis, Technischen Universität Berlin.
Schölkopf, B., Burges, C., & Vapnik, V. (1995). Extracting support data for a given task. In U. Fayyad, & R. Uthurusamy (eds.), Proc. of First International Conference on Knowledge Discovery and Data Mining (pp. 252–257). Menlo Park, CA. AAAI Press.
Google Scholar
Schölkopf, B., Platt, J., Shawe-Taylor, J., A., S., & Williamson, R. (1999a). Estimating the support of a high-dimensional distribution. Neural Computation, 13:7.
Schölkopf, B., Williamson, R., Smola, A., & Shawe-Taylor, J. (1999b). SV estimation of a distribution's support. In Advances in Neural Information Processing Systems.
Smola, A., Schölkopf, B., & Müller, K. (1998). The connection between regularization operators and support vector kernels. Neural Networks, 11, 637–649.
Google Scholar
Tarassenko, L., Hayton, P., & Brady, M. (1995). Novelty detection for the identification of masses in mammograms. In Proc. of the Fourth International IEE Conference on Artificial Neural Networks Vol. 409 (pp. 442–447).
Google Scholar
Tax, D., Ypma, A., & Duin, R. (1999). Support vector data description applied to machine vibration analysis. In M. Boassen, J. Kaandorp, J. Tonino, & V. M.G. (eds.), Proceedings of the Fifth Annual Conference of the ASCI (pp. 398–405).
Tax, D., & Duin, R. (1999). Support vector domain description. Pattern Recognition Letters, 20:11–13, 1191–1199.
Google Scholar
Tax, D., & Duin, R. (2000). Data descriptions in subspaces. In Proceedings of the International Conference on Pattern Recognition 2000, Vol. 2 (pp. 672–675).
Google Scholar
Vapnik, V. (1998). Statistical Learning Theory. Wiley.

Download references

Author information

Authors and Affiliations

Pattern Recognition Group, Faculty of Applied Sciences, Delft University of Technology, Lorentzweg 1, 2628, CJ Delft, The Netherlands
David M.J. Tax

Authors

David M.J. Tax
View author publications
You can also search for this author in PubMed Google Scholar
Robert P.W. Duin
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tax, D.M., Duin, R.P. Support Vector Data Description. Machine Learning 54, 45–66 (2004). https://doi.org/10.1023/B:MACH.0000008084.60811.49

Download citation

Issue Date: January 2004
DOI: https://doi.org/10.1023/B:MACH.0000008084.60811.49

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Support Vector Data Description

Abstract

Article PDF

Similar content being viewed by others

Ellipsoidal support vector data description

Parallel Support Vector Data Description

A New Fuzzy Support Vector Data Description Machine

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Support Vector Data Description

Abstract

Article PDF

Similar content being viewed by others

Ellipsoidal support vector data description

Parallel Support Vector Data Description

A New Fuzzy Support Vector Data Description Machine

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation