skip to main content
10.1145/1636682.1636693acmotherconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free Access

Early traffic classification using support vector machines

Published:24 September 2009Publication History

ABSTRACT

Internet traffic classification is an essential task for managing large networks. Network design, routing optimization, quality of service management, anomaly and intrusion detection tasks can be improved with a good knowledge of the traffic.

Traditional classification methods based on transport port analysis have become inappropriate for modern applications. Payload based analysis using pattern searching have privacy concerns and are usually slow and expensive in computational cost.

In recent years, traffic classification based on the statistical properties of flows has become a relevant topic. In this work we analyze the size of the firsts packets on both directions of a flow as a relevant statistical fingerprint. This fingerprint is enough for accurate traffic classification and so can be useful for early traffic identification in real time.

This work proposes the use of a supervised machine learning clustering method for traffic classification based on Support Vector Machines. We compare our method accuracy with a more classical centroid based approach, obtaining promising results.

References

  1. Application layer packet classifier for linux (l7-filter), http://17-filter.sourceforge.net/.Google ScholarGoogle Scholar
  2. L. Bernaille, R. Teixeira, I. Akodkenou, A. Soule, and K. Salamatian. Traffic classification on the fly. SIGCOMM Comput. Commun. Rev., 36(2):23--26, April 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Bernaille, R. Teixeira, and K. Salamatian. Early application identification. In CoNEXT '06: Proceedings of the 2006 ACM CoNEXT conference, pages 1--12, New York, NY, USA, 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.Google ScholarGoogle Scholar
  5. N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, March 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Crotti, M. Dusi, F. Gringoli, and L. Salgarelli. Traffic classification through simple statistical fingerprinting. SIGCOMM Comput. Commun. Rev., 37(1):5--16, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. W. Hsu, C. C. Chang, and C. J. Lin. A practical guide to support vector classification. Technical report, Taipei, 2003.Google ScholarGoogle Scholar
  8. N.-F. Huang, G.-Y. Jai, and H.-C. Chao. Early identifying application traffic with application characteristics. pages 5788--5792, May 2008.Google ScholarGoogle Scholar
  9. T. Karagiannis, K. Papagiannaki, and M. Faloutsos. Blinc: multilevel traffic classification in the dark. SIGCOMM Comput. Commun. Rev., 35(4):229--240, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H.-C. Kim, K. Claffy, M. Fomenkov, D. Barman, M. Faloutsos, and K. Lee. Internet traffic classification demystified: Myths, caveats, and the best practices. In ACM CoNEXT 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Z. Li, R. Yuan, and X. Guan. Accurate classification of the internet traffic based on the svm method. pages 1373--1378, June 2007.Google ScholarGoogle Scholar
  12. J. Ma, K. Levchenko, C. Kreibich, S. Savage, and G. M. Voelker. Unexpected means of protocol inference. In IMC '06: Proceedings of the 6th ACM SIGCOMM conference on Internet measurement, pages 313--326, New York, NY, USA, 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Mcgregor, M. Hall, P. Lorier, and J. Brunskill. Flow clustering using machine learning techniques. In In PAM, pages 205--214, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  14. A. Moore and K. Papagiannaki. Toward the Accurate Identification of Network Applications. In Proceedings of the Passive y Active Measurement Workshop (PAM2005), March/Apri 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. W. Moore and D. Zuev. Internet traffic classification using bayesian analysis techniques. SIGMETRICS Perform. Eval. Rev., 33(1):50--60, June 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Roughan, S. Sen, O. Spatscheck, and N. Duffield. Class-of-service mapping for qos: A statistical signature-based approach to ip traffic classification. In In IMCŠ04, pages 135--148, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Sen, O. Spatscheck, and D. Wang. Accurate, scalable in-network identification of p2p traffic using application signatures. In WWW '04: Proceedings of the 13th international conference on World Wide Web, pages 512--521, New York, NY, USA, 2004. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Valenti, D. Rossi, M. Meo, M. Mellia, and P. Bermolen. A behavioral classification framework for p2p-tv applications. Technical Report WP3.1, TELECOM ParisTech (France), Politecnico di Torino (Italy), January 2009.Google ScholarGoogle Scholar
  19. V. N. Vapnik. The nature of statistical learning theory. Springer-Verlag New York, Inc., New York, NY, USA, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. D. A. M. Wei Li, Kaysar Abdin. Approaching real-time network traffic classification. Technical Report RR-06-12, Department of Computer Science, Queen Mary, University of London, Mile End Road, London E1 4NS, UK, October 2006.Google ScholarGoogle Scholar
  21. Y. xiang Yang, R. Wang, Y. Liu, S. zhen Li, and X. yong Zhou. Solving p2p traffic identification problems via optimized support vector machines. In AICCSA {21}, pages 165--171.Google ScholarGoogle Scholar

Index Terms

  1. Early traffic classification using support vector machines

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        LANC '09: Proceedings of the 5th International Latin American Networking Conference
        September 2009
        108 pages
        ISBN:9781605587752
        DOI:10.1145/1636682

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 September 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader