skip to main content
10.1145/1626195.1626228acmconferencesArticle/Chapter ViewAbstractPublication PagessinConference Proceedingsconference-collections
research-article

DFA-based and SIMD NFA-based regular expression matching on cell BE for fast network traffic filtering

Authors Info & Claims
Published:06 October 2009Publication History

ABSTRACT

Regular expression matching is the heart of many data processing routines, such as string search, network traffic filtering, etc. The traditional way of regexp matching is building and execution of a deterministic finite automaton (DFA), that provides O(1) processing time per 1 input symbol for any regular expression. But this technique almost always forces many modern SIMD-processors to perform regexp search in scalar mode, thus it doesn't use the most part of their computational power.

This paper represents traditional straightforward DFA along with another regexp implementation, based on nondeterministic finite automata (NFA) SIMD-simulation on Cell Broadband Engine processor. Software implementation of NFA-based SIMD algorithm achieves as much as 10 Gbit/s per one Cell BE processor with 512 NFA states, thus it is feasible for preliminary network traffic filtering of suspicious objects, while DFA-based scalar one gains up to 60 Gbit/s with 60-state automaton. One Cell BE processor can maintain NFA of 6000..7000 overall states simultaneously, so if one wants to use signatures with more that 512 states, it's possible with linear performance-to-signatures tradeoff.

References

  1. Aho A. V., Corasick M. J. 1975 Efficient string matching: An aid to bibliographic search, Comm. of ACM, 18(6):333--340, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Thompson K. 1968 Regular expression search algorithm. Communications of the ACM 11(6) (June 1968), pp. 419--422. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. AbuHmed T. 2007 Deep Packet Inspection for Intrusion Detection Systems: A Survey. Technical Report. Information Security Research Laboratory, Inha University, Incheon 402--751, Korea.Google ScholarGoogle Scholar
  4. Cox R. 2007 Regular Expression Matching Can Be Simple And Fast. http://swtch.com/~rsc/regexp/Google ScholarGoogle Scholar
  5. Kumar S., Yu F., Crowley P., Turner J. 2006. Algorithms to Accelerate Multiple Regular Expressions Matching for Deep Packet Inspection. In proc. of SIGCOMM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Kumar S., Turner J., Williams J. 2006 Advanced Algorithms for Fast and Scalable Deep Packet Inspection. In proc. of ANCS-2006, San Jose, California, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Kumar S., Turner J., Varghese G. 2007 Curing Regular Expressions Matching Algorithms from Insomnia. Amnesia, and Acalculia. In proc. of ANCS, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Smith R., Estan C., Jha S. 2008 XFA: Faster Signature Matching With Extended Automata. In IEEE Simposium on Security and Privacy, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Goyal N., Ormont J., Smith R., Estan C. 2008 Signature Matching in Network Processing Using SIMD/GPU Architectures. Technical Report #1628. University of Wisconsin, Madison.Google ScholarGoogle Scholar
  10. Ficara D., Giordano S., Procissi G. 2008 An Improved DFA for Fast Regular Expression Matching.--ACM SIGCOMM Computer Communication Review, Vol. 38, Number 5, 2008 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 2007 Cell Broadband Engine Programming Handbook (version 1.1). www.ibm.comGoogle ScholarGoogle Scholar
  12. 2007 Synergistic Processor Unit Instruction Set Architecture (version1.2). www.ibm.comGoogle ScholarGoogle Scholar
  13. Arevalo A. et al. 2008 Programming the Cell Broadband Engine Examples and Best Practices. IBM Redbook SG24-7575-00. www.ibm.comGoogle ScholarGoogle Scholar
  14. Yu F., Chen Z., Diao Y. 2006 Fast and Memory-Efficient Regular Expression Matching for Deep Packet Inspection. Technical report, University of California at BerkeleyGoogle ScholarGoogle Scholar
  15. Aho A. V., Sethi, R., Ullman, J. D. "Compilers : Principles, Techniques, and Tools". Addison-Wesley, 1985. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Scarpazza D. P., Villa O., Petrini F. 2008 Exact Multi-pattern String Matching on the Cell/B.E. Processor. CF'08, May 5--7, 2008, Ischia, Italy Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Navarro G. 2003 Pattern Matching. Technical Report. Department of Computer Science, University of Chile.Google ScholarGoogle Scholar
  18. Baeza-Yates R., Gonnet R. 1992 A new approach to text searching. Communications of the ACM, 35(10):74{82), 1992 Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Snort--the open source IDS, www.snort.orgGoogle ScholarGoogle Scholar

Index Terms

  1. DFA-based and SIMD NFA-based regular expression matching on cell BE for fast network traffic filtering

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIN '09: Proceedings of the 2nd international conference on Security of information and networks
        October 2009
        322 pages
        ISBN:9781605584126
        DOI:10.1145/1626195

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 6 October 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate102of289submissions,35%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader